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ADENO-ASSOCIATED VIRUSES AND USES THEREOF 

Statement of Government Rights 

This invention was made at least in part with a grant from the Government of 
the United States of America (grant HL51887 from the National Institutes of 
Health). The Government may have certain rights in the invention. 



Claim of Priority 

This application claims priority under 35 U.S.C. 1 19(e) from U.S. 
Provisional Application Serial No. 60/158,209, filed October 7, 1999. 



Co-Pending Application 

This invention is related to the following invention which is assigned to the 
same assignee as the present invention: 

U.S. Application Serial No. 09/276,625, filed March 25, 1999, titled 
"ADENO-ASSOCIATED VIRUS VECTORS" (Atty. Docket No. 875.007US1). 



Background of the Invention 

Adeno-associated virus (AAV) is a non-pathogenic parvovirus with a 
single-stranded DNA genome of 4680 nucleotides. The genome may be of either 
plus or minus polarity, and codes for two groups of genes, Rep and Cap (Bems et 
al, 1990). Inverted terminal repeats (ITRs), characterized by palindromic sequences 
producing a high degree of secondary structure, are present at both ends of the viral 
genome. While other members of the parvovirus group replicate autonomously, 
AAV requires co-infection with a helper virus (i.e., adenovirus or herpes virus) for 
lytic phase productive replication. In the absence of a helper virus, wild-type AAV 
(wtAAV) establishes a latent, non-productive infection with long-term persistence 
by integrating into a specific locus on chromosome 19, AAVS1, of the host genome 
through a Rep-facilitated mechanism (Samulski, 1993; Linden et al., 1996; Kotin et 
al., 1992). 
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In contrast to wtAAV, the mechanism(s) of latent phase persistence of 
recombinant AAV (rAAV) is less clear. rAAV integration into the host genome is 
not site-specific due to deletion of the AAV Rep gene (Ponnazhagan et al., 1997). 
Analysis of integrated pro viral structures of both wild type and recombinant AAV 
5 have demonstrated head-to-tail genomes as the predominant structural forms. 

rAAV has recently been recognized as an extremely attractive vehicle for 
gene delivery (Muzyczka, 1992). rAAV vectors have been developed by 
substituting all viral open reading frames with a therapeutic minigene, while 
retaining the cis elements contained in two inverted terminal repeats (ITRs) 
10 (Samulski et al, 1987; Samulski et al., 1989). Following transduction, rAAV 

genomes can persist as episomes (Flotte et al., 1994; Afione et al., 1996; Duan et al., 
j»% 1998), or alternatively can integrate randomly into the cellular genome (Berns et al., 

^3 1996; McLaughlin et al., 1988; Duan et al., 1997; Fisher-Adams et al., 1996; Kearns 

et al., 1996; Ponnazhagan et al., 1997). However, little is known about the 
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1 5 mechanisms enabling rAAV vectors to persist in vivo or the identity of cellular 
factors which may modulate the efficiency of transduction and persistence. 
Although transduction of rAAV has been demonstrated in vitro in cell culture 
(Muzyczka, 1992) and in vivo in various organs (Kaplitt et al., 1994; Walsh et al., 
1994; Conrad et al., 1996; Herzog et al, 1997; Snyder et al., 1997), the mechanisms 

20 of rAAV-mediated transduction remain unclear. 

Moreover, while rAAV has been shown to be capable of stable, long-term 
transgene expression both in vitro and in vivo in a variety of tissues, the transduction 
efficiency of rAAV is markedly variable in different cell types. For example, rAAV 
has been reported to transduce lung epithelial cells at low levels (Halbert et al., 

25 1997; Duan et al., 1998a), while high level, persistent transgene expression has been 
demonstrated in muscle, neurons and in other non-dividing cells (Kessler et al, 
1996; Fisher et al., 1997; Herzog et al., 1997; Xiao et al., 1996; Kaplitt et al., 1994; 
Wu et al., 1998; Ali et al, 1996; Bennett et al., 1997 Westfall et al., 1997). These 
tissue-specific differences in rAAV mediated gene transfer may, in part, be due to 

30 variable levels of cellular factors affecting AAV infectivity (i.e., receptors and co- 
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receptors such as heparin sulfate proteoglycan, FGFR-1, and aVp5 integrin) 
(Summerford et al., 1998; Qing et al., 1999; Summerford et al., 1999) as well as the 
latent life cycle (i.e., nuclear trafficking of virus and/or the conversion of single 
stranded genomes to expressible forms) (Qing et al., 1997; Qing et al., 1998). 

Muscle-mediated gene transfer represents a very promising approach for the 
treatment of hereditary myopathies and several other metabolic disorders. Previous 
studies have demonstrated remarkably efficient and persistent transgene expression 
to skeletal muscle in vivo with rAAV vectors. Applications in this model system 
include the treatment of several inherited disorders such as Factor IX deficiency in 
hemophilia B and epo deficiencies (Kessler et al., 1996; Herzog et al., 1997). 
Although the conversion of low-molecular- weight rAAV genomes to high- 
molecular-weight concatamers has been inferred as evidence for integration of 
proviral DNA in the host genome, no direct evidence exists in this regard (Xiao et 
al., 1996; Clark et al., 1997; Fisher et al. 1997). Also, the molecular processes 
and/or structures associated with episomal long-term persistence of rAAV genomes, 
e.g., in nondividing mature myofibers, remains unclear. 

In addition, due to limitations in rAAV vector packaging capacity, a rAAV 
vector may not be useful if large regulatory elements are needed to control transgene 
expression. 

Thus, there is a need for rAAV vectors that have increased stability and/or 
persistence in host cells. Moreover, there is a need for vectors useful to express 
large open reading frames. 

Summary of the Invention 

The present invention provides a recombinant adeno-associated virus 
(rAAV) vector comprising a nucleic acid segment formed by the juxtaposition of 
sequences in the AAV inverted terminal repeats (ITRs) which are present in a 
circular intermediate of AAV. The circular intermediate was isolated from rAAV- 
infected cells by employing a recombinant AAV "shuttle" vector. The shuttle 
vector comprises: a) a bacterial origin of replication; b) a marker gene or a selectable 



gene; c) a 5' ITR; and d) a 3' ITR. Preferably, the recombinant AAV shuttle vector 
contains a reporter gene, e.g., a GFP, alkaline phosphatase or p-galactosidase gene, 
a selectable marker gene, e.g., an ampicillin-resistance gene, a bacterial origin of 
replication, a 5' ITR and a 3' ITR. The vector is contacted with eukaryotic cells so 
as to yield transformed eukaryotic cells. Low molecular weight DNA ("Hirt DNA") 
from the transformed eukaryotic cells is isolated. Bacterial cells are contacted with 
the Hirt DNA so as to yield transformed bacterial cells. Then bacterial cells are 
identified which express the marker or selectable gene present in the shuttle vector 
and which comprise at least a portion of a circular intermediate of adeno-associated 
virus. Also, as described below, it was found that circularized intermediates of 
rAAV impart episomal persistence to linked sequences in Hela cells, fibroblasts and 
muscle cells. In HeLa cells, the incorporation of certain AAV sequences, e.g., ITRs, 
from circular intermediates into a heterologous plasmid conferred a 10-fold increase 
in the stability of plasmid-based vectors in HeLa cells. Unique features of these 
transduction intermediates included the in vivo circularization of a head-to-tail 
monomer as well as multimer (concatamers) episomal viral genomes with associated 
specific base pair alterations in the 5' viral D-sequence. The majority of circular 
intermediates had a consistent head-to-tail configuration consisting of monomer 
genomes (<3 kb) which slowly converted to large multimers of >12 kb by 80 days 
post-infection in muscle. Importantly, long-term transgene expression was 
associated with prolonged (80 day) episomal persistence of these circular 
intermediates. Thus, in vivo persistence of rAAV can occur through episomal 
circularized genomes which may represent prointegration intermediates with 
increased episomal stability. Moreover, as described below, co-infection with 
adenovirus, at high multiplicities of infection (MOI) capable of producing early 
adenoviral gene products, led to increases in the abundance and stability of AAV 
circular intermediates which correlated with an elevation in transgene expression 
from rAAV vectors. Thus, these results demonstrate the existence of a molecular 
structure involved in AAV transduction which may play a role in episomal 
persistence and/or integration. 



Further, these results may aid in the development of non- viral or viral-based 
gene delivery systems having increased efficiency. For example, therapeutic or 
prophylactic therapies in which the present vectors are useful include blood 
disorders (e.g., sickle cell anemia, thalassemias, hemophilias, and Fanconi anemias), 
neurological disorders, such as Alzheimer's disease and Parkinson's disease, and 
muscle disorders involving skeletal, cardiac or smooth muscle. In particular, , 
therapeutic genes useful in the vectors of the invention include the p-globin gene, 
the y-globin gene, the cystic fibrosis transmembrane conductance receptor gene 
(CFTR), the erythropoietin (epo) gene, the Fanconi anemia complementation group, 
a gene encoding a ribozyme, an antisense gene, a low density lipoprotein (LDL) 
gene, a tyrosine hydroxylase gene (Parkinson's disease), a glucocerebrosidase gene 
(Gaucher' s disease), an arylsulfatase A gene (metachromatic leukodystrophies) or 
genes encoding other polypeptides or proteins. Also within the scope of the 
invention is the inclusion of more than one gene in a vector of the invention, i.e., a 
plurality of genes may be present in an individual vector. Further, as a circular 
intermediate may be a concatamer, each monomer of that concatamer may comprise 
a different gene, or a portion thereof. 

For viral-based delivery systems, helper-free virus can be prepared (see WO 
95/13365) from circular intermediates or vectors of the invention. Alternatively, 
liposomes, plasmid or virosomes may be employed to deliver a vector of the 
invention to a host or host cell. 

The increased persistence of circular intermediates or vectors having one or a 
plurality of ITRs may be due to the primary and/or secondary structure of the ITRs. 
The primary structure of a consensus sequence (SEQ ID NO:3) of ITRs formed by 
the juxtaposition and physical (phosphodiester bond) linkage of ITRs from AAV is 
shown in Figure 2C. However, as described hereinbelow, each ITR sequence may 
be incomplete, i.e., the ITR may be a subunit or portion of the full length ITRs 
present in the consensus sequence. Moreover, preferably, an isolated DNA segment 
of the invention is not the 165 bp double DD sequence (SEQ ID NO:7) disclosed in 
U.S. Patent No. 5,478,745, referred to as a "double sequence". 



Moreover, the formation, persistence and/or abundance of molecules having 
the ITR sequences of the invention may be modulated by helper virus, e.g., 
adenoviral proteins and/or host cell proteins. Thus, the circular intermediates or 
vectors of the invention may be useful to identify and/or isolate proteins that bind to 
5 the ITR sequences present in those molecules. 

Therefore, the present invention provides an isolated and purified DNA 
molecule comprising at least one DNA segment, a biologically active subunit or 
variant thereof, of a circular intermediate of adeno-associated virus, which DNA 
segment confers increased episomal stability, persistence or abundance of the 
10 isolated DNA molecule in a host cell. Preferably, the DNA molecule comprises at 
least a portion of a left (5') inverted terminal repeat (ITR) of adeno-associated virus. 
Also preferably, the DNA molecule comprises at least a portion of a right (3')- 
inverted terminal repeat of adeno-associated virus. The invention also provides a 
£0 gene transfer vector, comprising: at least one first DNA segment, a biologically 

Ifl 15 active subunit or variant thereof, of a circular intermediate of adeno-associated 

~j virus, which DNA segment confers increased episomal stability or persistence of the 

e vector in a host cell; and a second DNA segment comprising a gene. Preferably, the 

it 

5" — 

q second DNA segment encodes a therapeutically effective polypeptide. The first 

*~ DNA segment comprises ITR sequences, preferably at least about 100, more 

CP 

Q 20 preferably at least about 300, and even more preferably at least about 400, bp of 

~* adeno-associated virus sequence. A preferred vector of the invention is a plasmid. 

Thus, the vector of the invention is useful in a method of delivering and/or 
expressing a gene in a host cell, to prepare host cells having the vector(s), and in the 
preparation of compositions comprising such vectors. To deliver the gene to the 
25 host cell, a recombinant adenovirus helper virus may be employed. 

The implications of intermolecular recombination of rAAV genomes to form 
a single circular episome, which may be a circular concatamer comprising at least 
two different rAAV genomes, is particularly relevant for gene therapy with rAAV. 
First, large regulatory elements and genes beyond the packaging capacity of rAAV 
30 can be brought together by co-infecting tissue with two independent vectors. For 
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example, enhancers and/or promoters may be introduced into one vector while DNA 
comprising an open reading frame, e.g., a gene of interest, with or without a 
minimal promoter, is introduced into a second vector. Thus, after co-infection with 
the two vectors, the transgene cassette size is increased beyond that for a single 
5 AAV vector alone and the DNA comprising the opening reading frame is linked to 
the enhancer and/or promoter. In another embodiment, of the invention, vectors 
encoding two independent regions of a gene are brought together to form an intact 
splicing unit by circular concatamerization. In a further embodiment of the 
invention, a vector comprising an origin of replication and a DNA encoding a 
10 protein that binds to the origin and promotes replication and/or maintenance of DNA 
that is linked to the origin, and a vector comprising a gene of interest are brought 
p together after co-infection to form an autonomously replicating episome comprising 

the gene. 

As described hereinbelow, the tibialis muscle of mice was co-infected with 
15 rAAV Alkaline phosphatase (Alkphos) and GFP encoding vectors. The GFP shuttle 
>S vector also encoded ampicillin resistance and a bacterial origin of replication to 

3 allow for bacterial rescue of circular intermediates in Hirt DNA from infected 

muscle samples. There was a time dependent increase in the abundance of rescued 
plasmids encoding both GFP and Alkphos that reached 33% of the total circular 
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□ 20 intermediates by 120 days post-infection. Furthermore, these large circular 

Q 

concatamers were capable of expressing both GFP and Alkphos encoded transgenes 
following transient transfection in cell lines. Thus, concatamerization of AAV 
genomes in vivo occurs through intermolecular recombination of independent 
monomer circular viral genomes. Therefore, a plurality of DNA segments, each in 

25 an individual rAAV vector, may be delivered so as to result in a single DNA 

molecule having a plurality of the DNA segments. For example, one rAAV vector 
comprises a first DNA segment comprising a 5' ITR linked to a second DNA 
segment comprising a promoter operably linked to a third DNA segment comprising 
a first open reading frame linked to a fourth DNA segment comprising a 3 ' ITR. A 

30 second rAAV vector comprises a first DNA segment comprising a 5' ITR linked to 
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a second DNA segment comprising a promoter operably linked to a third DNA 
segment comprising a second open reading frame linked to a fourth DNA segment 
comprising a 3' ITR. 

In another embodiment, one rAAV vector comprises a first DNA segment 
comprising a 5 ' ITR linked to a second DNA segment comprising a promoter 
operably linked to a third DNA segment comprising the 5 ' end of an open reading 
frame linked to fourth DNA segment comprising a 5' splice site linked to a fifth 
DNA segment comprising a 3 ' ITR. The second rAAV vector comprises a first 
DNA segment comprising a 5' ITR linked to a second DNA segment comprising a 
3' splice site linked to a third DNA segment comprising the 3' end of the open 
reading frame linked to a fourth DNA segment comprising a 3' ITR. Preferably, the 
second and third DNA segments together comprise DNA encoding, for example, 
CFTR, factor VIII, dystrophin, or erythropoietin. Also preferably, the second DNA 
segment comprises the endogenous promoter of the respective gene, e.g., the epo 
promoter. 

Thus, the invention provides a composition comprising: a first adeno- 
associated virus vector comprising linked DNA segments and at least a second 
adeno-associated virus comprising linked DNA segments. The linked DNA 
segments of the first vector comprise: a first DNA segment comprising a 5' ITR; a 
second DNA segment comprising at least a portion of an open reading frame 
operably linked to a promoter, wherein the DNA segment does not comprise the 
entire open reading frame; a third DNA segment comprising a splice donor site; and 
iv) a fourth DNA segment comprising a 3' ITR. The linked DNA segments of the 
second vector comprise a first DNA segment comprising a 5 ' ITR; a second DNA 
segment comprising a splice acceptor site; a third DNA segment comprising at least 
a portion of an open reading frame which together with the second DNA segment of 
the first vector encodes a full-length polypeptide; and a fourth DNA segment 
comprising a 3' ITR. Preferably, the second DNA segment of the first vector 
comprises a first exon of a gene comprising more than one exon and the third DNA 
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segment of the second vector comprises at least one exon of a gene that is not the 
first exon. 

The invention also provides a method to transfer and express a polypeptide 
in a host cell. The method comprises contacting the host cell with at least two 
5 rAAV vectors. One rAAV vector comprises a first DNA segment comprising a 5 
'ITR linked to a second DNA segment comprising a promoter operably linked to a 
third DNA segment comprising a first open reading frame linked to a fourth DNA 
segment comprising a 3 ' ITR. A second rAAV vector comprises a first DNA 
segment comprising a 5' ITR linked to a second DNA segment comprising a 
10 promoter operably linked to a third DNA segment comprising a second open reading 
frame linked to a fourth DNA segment comprising a 3 'ITR. Alternatively, one 
rAAV vector comprises a first DNA segment comprising a 5 'ITR linked to a 
second DNA segment comprising a promoter operably linked to a third DNA 
segment comprising the 5 ' end of an open reading frame linked to fourth DNA 
Ul 15 segment comprising a 5 ' splice site linked to a fifth DNA segment comprising a 3 ' 

sj jj 

ITR. The second rAAV vector comprises a first DNA segment comprising a 5' 
3 ITR linked to a second DNA segment comprising a 3' splice site linked to a third 

p DNA segment comprising the 3' end of the open reading frame linked to a fourth 

DNA segment comprising a 3 ITR. The host cell is preferably contacted with both 
20 of the vectors, concurrently, although it is envisioned that the host cell may be 
contacted with each vector at a different time relative to the contact with the other 
vector(s). 

Also provided is a method in which the composition of the invention is 
administered to the cells or tissues of an animal. For example, rAAV vectors have 
25 shown promise in transferring the CFTR gene into airway epithelial cells of animal 
models and nasal sinus of CF patients. However, high level expression of CFTR 
has not been achieved due to the fact that AAV cannot accommodate the full-length 
CFTR gene together with a potent promoter. A number of studies have tried to 
optimize rAAV-mediated CFTR expression by utilizing truncated or partially 
30 deleted CFTR genes together with stronger promoters. However, it is currently 
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unknown what effect deletions within the CFTR gene may have on 
complementation of bacterial colonization defects in the CF airway. Therefore, the 
present invention includes the administration to an animal of a composition of the 
invention comprising at least two rAAV vectors which together encode CFTR. The 
present invention is useful to overcome the current size limitation for transgenes 
within rAAV vectors, and allows for the incorporation of a larger transcriptional 
regulatory region, e.g., a stronger heterologous promoter or the endogenous CFTR 
promoter. 

As described hereinbelow, transgene expression from rAAV luciferase 
vectors, with or without a promoter, can be greatly enhanced by co-infection with an 
independent rAAV vector carrying the cytomegalovirus (CMV) and simian virus 40 
(S V40) enhancers. Thus, co-infection with a transgene containing vector and a 
second vector comprising at least one, preferably at least two or more, enhancer 
sequences, of cell lines and muscle in vivo resulted in a greater than 600-fold 
enhancement of transgene expression from a minimal SV40 promoter. Furthermore, 
200-fold enhancement was also achieved by c/s-activation of ITRs in transgene 
containing vectors without a promoter. Thus, large regulatory elements including 
tissue specific enhancers can be introduced into cells by a separate rAAV to regulate 
the expression of a second transgene containing vector in cis following intracellular 
concatamerization. 

Thus, the invention provides a composition comprising at least two 
recombinant AAV genomes. The composition comprises a first recombinant AAV 
comprising a first recombinant DNA molecule comprising linked: i) a first DNA 
segment comprising a 5 '-inverted terminal repeat of AAV; ii) a second DNA 
segment which does not comprise AAV sequences; and iii) a third DNA segment 
comprising a 3 ' -inverted terminal repeat of AAV; and comprises a second 
recombinant AAV comprising a second recombinant DNA molecule comprising 
linked: i) a first DNA segment comprising a 5 '-inverted terminal repeat of AAV; ii) 
a second DNA segment which does not comprise AAV sequences and which second 
DNA segment is different than the second DNA segment of the first recombinant 
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DNA molecule; and iii) a third DNA segment comprising a 3 '-inverted terminal 
repeat of AAV. The composition of the invention, is preferably contacted with a 
mammalian host cell, e.g., a murine, canine, feral or human cell. Alternatively, a 
host cell may be contacted with each recombinant AAV individually, e.g., 
5 sequentially. 

Thus, in one embodiment of the invention, a host cell is contacted with at 
least two recombinant AAV genomes. A first recombinant AAV comprises a first 
recombinant DNA molecule comprising linked:i ) a first DNA segment comprising 
a 5 '-inverted terminal repeat of AAV; ii) a second DNA segment which does not 
10 comprise AAV sequences; and iii) a third DNA segment comprising a 3 '-inverted 
terminal repeat of AAV. A second recombinant AAV comprises a second 
recombinant DNA molecule comprising linked: i) a first DNA segment comprising 
a 5 '-inverted terminal repeat of AAV; ) a second DNA segment which does not 
comprise AAV sequences and which second DNA segment is different than the 



Ul 15 second DNA segment of the first recombinant DNA molecule; and iii) a third DNA 

IH 

1* segment comprising a 3 '-inverted terminal repeat of AAV. 

p recombinant DNA molecule comprises a portion of an open reading frame, e.g., an 



In one embodiment of the invention, the second DNA segment of the first 



exon of a multi-exon gene, operably linked to a promoter. For example, the 
20 promoter may be the endogenous promoter for the gene corresponding to the open 
reading frame. Preferably, the second DNA segment of the second recombinant 
DNA molecule comprises the remainder of the open reading frame which together 
with the second DNA segment of the first recombinant DNA molecule encodes a 
full-length polypeptide. Also preferably, the first recombinant DNA molecule 
25 comprises a splice donor site 3' to the open reading frame, and the second DNA 
segment of the second recombinant DNA molecule comprises a splice acceptor site 
5' to the remainder of the open reading frame. 

In another embodiment of the invention, the second DNA segment of the 
first recombinant DNA molecule comprises at least one heterologous enhancer 
30 and/or at least one heterologous promoter, i.e., the enhancer and/or promoter 
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sequences are not derived from AAV sequences. Preferably, the second DNA 
segment of the second recombinant DNA molecule comprises at least a portion of 
an open reading frame. 

In yet a further embodiment of the invention, the second DNA segment of 
the first recombinant DNA molecule comprises an origin of replication 
functional in a host cell, e.g., a viral origin of replication such as OriP. Preferably, 
the origin is functional in a human cell. Also preferably, the second DNA segment 
of the first recombinant DNA molecule further comprises DNA encoding a protein 
that binds to the origin of replication, e.g., EBNA-1. The second DNA segment in 
the second recombinant DNA molecule comprises at least a portion of an open 
reading frame, and preferably a promoter operably linked to the open reading frame. 

In yet another embodiment of the invention, the second DNA segment of the 
first recombinant DNA molecule comprises a as-acting integration sequence(s) for 
a recombinase and also encodes a recombinase or integrase that is specific for the 
integration sequence(s), e.g., Cre/lox system of bacteriophage PI (U.S. Patent No. 
5,658,772), the FLP/FRT system of yeast, the Gin recombinase of phage Mu, the 
Pin recombinase of E. coli, the R/RS system of the pSRl plasmid, a retrotransposase 
or the integrase from a lentivirus or retrovirus. The second DNA segment in the 
second recombinant DNA molecule comprises at least a portion of an open reading 
frame, and preferably a promoter operably linked to the open reading frame. The 
formation of a concatamer comprising the first and the second recombinant DNA 
molecules, and the expression of the recombinase or integrase, will enhance the 
integration of the concatamer, or a portion thereof, into the host genome. 

Brief Description of the Figures 

Figure 1. Structure of proviral shuttle vector and the predicted structure of 
rAAV circular intermediate monomers. With the aid of a rAAV c/s-acting plasmid, 
pCisAV.GFP3ori (Panel A), AV.GFP3ori recombinant virus was produced 
(Panel B). This vector encoded a GFP transgene cassette, an ampicillin resistance 
gene (amp), and a bacterial replication origin (ori). The predominant form of 
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circular intermediates isolated following transduction of Hela cells with 
AV.GFP3ori consisted of head- to-tail monomers (Panels C and D). 

Figure 2. Structural analysis of r AAV circular intermediates in Hela cells. 
Circular rAAV intermediate clones isolated from AV.GFP3ori infected Hela cells 
were analyzed by diagnostic restriction digestion with Asel, SphI, and PstI together 
with Southern blotting against ITR, GFP, and Stuffer 32 P-labeled probes. In panel 
A, four clones representing the diversity of intermediates found (pl90, p333, p280, 
and p345) gave a diagnostic PstI (P) restriction pattern (3 kb and 1.7 kb bands) 
consistent with a circular monomer or multimer intact genome [agarose gel (Left) 
and Southern blot (Right)]. SphI (S) digestion demonstrated existence of a single 
ITR (pi 90), two ITRs in a head-to-tail orientation (p333 and p280), and three ITRs 
(p345) in isolated circular intermediates. The restriction pattern of pCisAV.GFP3ori 
(U; uncut, P; PstI cut, S; SphI cut) and 1 kb DNA ladder (L) are also given for 
comparison. One additional circular form (p340) was repetitively seen and had an 
unidentifiable structure which lacked intact ITR sequences. Circular concatamers 
were identified by partial digestion with Asel for clones p280 (dimer) and p333 
(monomer) as is shown in Panel B. Sequence analysis (Panel C) of six clones (SEQ 
ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, and SEQ 
ID NO: 13) with identical restriction patterns to p333 (Panel A) was performed using 
primers (indicated by arrows) juxtaposed to the partial p5 promoter (dotted line) and 
ITRs (solid line). The top sequence (SEQ ID NO:3) represents the proposed 
head-to-tail structure of intact ITR arrays with alignment of sequence derived from 
individual clones. The junction of the inverted ITRs is marked by inverted 
arrowheads (at 251 bp). Several consistent bp changes (shaded) were noted in the 
5 'ITR D-sequence (boxed) within four clones (p79, p81, p87, and p88). All bp 
changes are indicated in lower case letters. 

Figure 3. Adenovirus augments AAV circular intermediate formation in 
Hela cells. Infection of Hela cells with increasing doses (0, 500, and 5000 
particles/cell) of recombinant El -deleted adenovirus (Ad.CMVlacZ) leads to 
substantial expression of E2a 72kd DNA Binding Protein, as demonstrated by 
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indirect immunofluorescent staining for DBP at 72 hours post-infection (Panel A). 
Co-infection of Hela cells with Ad.CMVlacZ (5000 particles/cell) and AV.GFP3ori 
(1000 DNA particles/cell) led to substantial augmentation of rAAV GFP transgene 
expression (Panel B). Augmentation in rAAV GFP transgene expression in the 
presence of increasing amounts (0, 500, 5000 and 10000 particles/cell) of 
recombinant Ad.CMVlacZ was quantified by FACS analysis at 72 hour 
post-infection (Panel C). Results demonstrate the mean (+/-SEM) for two 
experiments performed in duplicate. In addition, an aliquot of cells was split (1:10) 
at the time of FACS analysis and GFP colony forming units (CFU) per 10X field 
were quantified at 6 days (CPE denotes significant cytopathic effects at an 
adenoviral MOI of 10,000 particles/cell and was not quantified for GFP colonies). 
Hirt DNAs from AV.GFP3ori (1000 DNA particles/cell) infected Hela cells with or 
without co-infection with Ad.CMVlacZ (5,000 particles/cell) were used to transform 
E. coli. The total number of ampicillin-resistant bacterial CFU (Panel D) and total 
number of head- to-tail circular intermediates CFU (Panel E) are given for a 
representative experiment. Greater than 20 clones for each time point were 
evaluated by Southern blot (see Figure 2 for detail). Zero hour controls were 
performed by mixing an equivalent amount of AV.GFP3ori virus as used in 
experiments with mock infected cellular lysates prior to Hirt purification. Panel F 
depicts the abundance of head-to-tail circular intermediates as a percentage of total 
ampicillin-resistant bacterial CFU isolated from Hirt DNA. 

Figure 4. Formation of rAAV head-to-tail circular intermediates following 
in vivo transduction of muscle. The tibialis anterior muscle of 4-5 week old 
C57BL/6 mice were infected with AV.GFP3ori (3 X 1010 particles) in HEPES 
buffered saline (30 |il). GFP expression (Panel A) was analyzed by direct 
immunofluorescence of freshly excised tissues and/or in formalin-fixed 
cryopreserved tissue sections in four independently injected muscles harvested at 0, 
5, 10, 16, 22 and 80 days post-infection. GFP expression was detected at low levels 
beginning at 10 days and was maximum at 22 days post-infection. Expression 
remained stable to 80 days at which time greater than 50% of the tissue was positive 
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(see 80 day tissue cross section counter stained with propidium iodide, panel A). 
Hirt DNA was isolated from muscle samples at each of the various time points and 
after points was used to transform E. coli. Rescued plasmids (p439, pi 6, pi 7) were 
analyzed by Southern blotting in Panel B showing an agarose gel on left and ITR 
5 probed blot on right. U:uncut, P:PstI cut, and S:SphI cut. The schematic drawing of 
the most predominant type of head-to-tail circular AAV intermediate plasmids 
rescued from bacteria is given in the right of Panel B and shows the structure of pi 7 
as an example. Other typical clones included those with less than two ITRs as 
shown for pi 6. SphI digestion of pi 6 and pi 7 plasmids released ITR hybridizing 
10 fragments of approximately 140 and 300 bp, respectively. The slightly lower 
mobility then predicted for these ITR fragments likely represents anomalous 
migration due to the high secondary structure of inverted repeats within ITRs. 

yp Sequence analysis of pl7 and pl6 using nested primers to 5' and 3'-ITRs also 

OR 

gj confirmed the ITR orientations shown to the right of the gel. Additional restriction 

15 enzyme analyses to determine this structure included double and single digests with 
yn SphI, PstI, Asel, and/or Smal. An example of an atypical clone (p439) rescued 

y from bacteria with unknown structure is also shown. 

H Figure 5. Frequency of circular intermediate formation in muscle following 

o 

p transduction with rAAV. Hirt DNAs isolated from rAAV infected tibialis muscle 

^ 20 were used to transform E. coli and the rescued plasmids analyzed by Southern 

O blotting (greater than 20 clones were analyzed from at least two independent muscle 

samples for each time point). The averages of total head-to-tail circular intermediate 
clones (line) and ampicillin resistant bacterial clones (bar) isolated from each tibialis 
anterior muscle at 0, 5, 10, 16, 22 and 80 days post-infection are summarized in 
25 Panel A. Only plasmids which contained 1-2 ITRs were included in the estimation 
of total head-to-tail circular intermediates. Plasmids which demonstrated an 
absence of ITR hybridizing SphI fragments (between 150 to 300 bp) were omitted 
from the calculations. Panel B demonstrates the diversity of ITR arrays found in 
head-to-tail circular intermediates at 80 days post-infection. This panel depicts a 
30 Southern blot probed with ITR sequences and represents circular intermediates with 



15 



1-3 ITRs. SphI fragments which hybridize to ITR probes indicate the size of 
inverted ITR arrays (marked by arrows to right of gel). Additional restriction 
enzyme analysis was used to determine the structure of monomer and multimer 
circular intermediates. Examples are shown for two multimer (pi 36 and pi 43) 
circular intermediates which contain approximately three AAV genomes. 
Undigested plasmids of pi 36 and pi 43 migrate greater than 12 kb and is contrasted 
to the most predominant form of head-to-tail undigested circular intermediates at 22 
days which migrate at 2.5 kb. The digestion pattern of pl36 is consistent with a 
uniform head-to-tail configuration of three genomes which is indistinguishable from 
digestion patterns of pi 39 which contains one circularized genome (undigested pi 39 
migrates at 2.5 kb, data not shown, also see examples pl7 in Figure 4). In contrast, 
pi 36 depicts a more complex head-to-tail multimer circular intermediate which has 
various deletions and duplications within the ITR arrays. Predicted structure of five 
representative intermediates is schematically shown in Panel C. 

Figure 6. Molecular size of circular intermediates in muscle. HirtDNA 
from AV.GFP3ori infected muscle was size fractionated by electrophoresis and 
various molecular weight fractions transformed into E. coli. Results demonstrate 
the abundance of circular intermediates at each of the given molecular weights at 22 
and 80 days post-infection with the rAAV shuttle vector. Structure of circular 
intermediates were confirmed by Southern blot restriction analysis. 

Figure 7. Head-to-tail circular intermediates demonstrate increased stability 
of GFP expression following transient transfection in Hela cells. Subconfluent 
monolayers of Hela cells were co-transfected with p81, p87, or pCMVGFP and 
pRSVlacZ as an internal control for transfection efficiency as described in the 
methods. Panel A demonstrates the expansion of GFP clones after one passage 
(arrows). Quantification of clone size and numbers are shown in Panel B. Clone 
size represents the mean raw values while clone numbers are normalized for 
transfection efficiency as determined by X-gal staining for pRSVlacZ. The data at 
the top of bar graph values for each construct in Panel B represents quantification of 
GFP clones after second passage (also normalized for transfection efficiency). 
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Results indicate the mean (+/-SEM) of duplicate experiments with greater than 20 
fields quantified for each experimental point. The persistence of transfected p81 and 
pCMVGFP plasmid DNA at passage-7 post-transfection was evaluated by genomic 
Southern blot of total cellular DNA hybridized against 32 P-labeled GFP probe (Panel 
C, results from two independent transfections are shown). U:uncut, C:PstI cut. The 
migration of uncut dimer and monomer plasmids forms are marked on the left. PstI 
digestion of the plasmids results in bands at 4.7 kb (pCMVGFP, single PstI site in 
plasmid) and 1.7 kb (p81, two PstI sites flanking the GFP gene). To determine 
whether the head-to-tail ITR array within circular intermediates was responsible for 
increases in the persistence of GFP expression, the head-to-tail ITR DNA element 
was subcloned into the pGL3 luciferase plasmid to generate pGL3(ITR). Results in 
Panel D compare the extent of luciferase transgene expression following 
transfection with pGL3 and pGL3(ITR) at 10 days (passage-2) post-transfection. 
Results are the mean (+/-SEM) for triplicate experiments and are normalized for 
transfection efficiency using a dual renilla luciferase reporter vector (pRLSV40, 
Promega). 

Figure 8. Identification of adenoviral genes responsible for augmentation of 
AAV circular intermediate formation. Hela cells were infected with AV.GFP3ori 
(1000 DNA particles/cell) in the presence of wtAd5, c//802 (E2a-deleted), and 
dl 1004 (E4-deleted) adenovirus (at the indicated MOIs). Total number of head-to- 
tail circular intermediates from Hirt DNA and the level of augmentation of GFP 
transgene expression (as determined by FACS) was quantified at 24 hours post- 
infection (Panel A). Results are the average of duplicate experiments. Panel B 
depicts results from Southern blot analysis of Hirt DNA following hybridization to a 
GFP P 32 -labeled probe. DNA loads were 10% of the total Hirt yield from a 35 mm 
plate of Hela cells. Infections were carried out identically to that described for 
Panel A. Arrows mark replication form concatamers (Rf c ), dimers (RQ, monomers 
(Rf m ), and single-stranded AAV genomes (ssDNA). 

Figure 9. Model for independent mechanistic interactions of adenovirus 
with lytic and latent phase aspects of the AAV life cycle. The adenoviral E4 gene 



17 



has been shown to augment the level of r AAV second strand synthesis giving rise to 
replication form dimers (RQ and monomers (Rf m ) (Figure 8B). This augmentation 
leads to substantial increases in transgene expression from rAAV vectors and most 
closely mirrors lytic phase replication of wtAAV as head-to-head and tail-to-tail 
5 concatamers. In contrast, E4 expression inhibits the formation of head-to-tail 

circular intermediates of AAV. Hence, it appears that increases in the amount of Rf d 
and Rf m double stranded DNA genomes does not increase the extent of circular 
intermediate formation. Such findings suggest that conversion of Rf m and Rf d to 
circular intermediates does not likely occur and implicates two mechanistically 
10 distinct pathway for their formation. In support of this hypothesis, adenoviral E2a 
gene expression does not enhance the formation of Rf m and Rf d genomes but rather 

Q increase the abundance and/or stability of head- to-tail circular intermediates. 

y3 

« Furthermore, in the absence of E4, E2a gene expression does not lead to 

C§ augmentation of rAAV transgene expression. Since circular intermediates have 

yj 15 increased episomal stability in muscle and in Hela cells, this molecular structure 

may be important in the latent phase of AAV persistence. Alternatively, these 
circular intermediates may represent pre-integration complexes as previously 
hypothesized for Rep facilitated integration. In the absence of Rep, circular 
intermediates may accumulate episomally in rAAV infected cells. In summary, 
20 these findings support the notion that adenovirus may modulate both latent and lytic 
aspects of the AAV life cycle. 

Figure 10. Individual chemical sequence of SphI fragments from p81 (A; 
SEQ ID NO:4), p79 (B; SEQ ID NO:5), and pl202 (C; SEQ ID NO:6) AAV 
circular intermediates. The ends of the sequence (underlined) represent SphI 
25 restriction enzyme sites within head-to-tail circular AAV genomes cloned with the 
AV-GFP3ori shuttle virus. 

Figure 11. Chemical sequence homology of three AAV circular 
intermediates with various conformations of ITR arrays (SEQ ID NO:4, SEQ ID 
NO: 5 and SEQ ID NO: 6). Diversity in ITR arrays are evident from the non- 
30 conserved bases marked in lower case. The ends of the sequence (underlined) 
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represent SphI restriction enzyme sites within head-to-tail circular AAV genomes 
cloned with the AV.GFP3ori shuttle virus. 

Figure 12 A. Palindromic repeat structure derived from chemical sequencing 
of AAV circular intermediate isolate p81. Secondary structure of the sense strand is 
depicted in the top box with plasmid reference given below. 

Figure 12B. Palindromic repeat structure derived from chemical sequencing 
of AAV circular intermediate isolate p79. Secondary structure of the sense strand is 
depicted in the top box with plasmid reference given below. 

Figure 12C. Palindromic repeat structure derived from chemical sequencing 
of AAV circular intermediate isolate p79. Secondary structure of the sense strand is 
depicted in the top box with plasmid reference given below. 

Figure 13. Persistence of GFP expression in developing Xenopus embryos 
microinjected with AAV circular intermediate isolate p8 1 . The extent of GFP 
fluorescence in tadpoles reflects the stability of episomal or integrated microinjected 
plasmids. Bright field image on the left is of the p81 injected embryo. The p81 
injected embryo depicts fluorescence in nearly all cells by one week post-injection. 
In contrast, a mosaic pattern of expression in a minority of cells in pCisAV.GFPori 
injected embryos. The pCisAV.GFPori plasmid contains the identical promoter 
sequences driving GFP gene expression and two ITRs separated by stuffer sequence. 
These findings demonstrate that specific structural characteristics found within 
AAV circular intermediates are responsible for increased persistence of transgene 
expression. 

Figure 14. Mechanistic scheme for determining pathways for rAAV circular 
concatamer formation. The two independent vectors used in these studies, 
AV.Alkphos and AV.GFP3.ori, are shown in Panel A . Restriction sites important 
in the structural analysis of circular intermediates are also shown. In Panel B, a 
schematic representation of two potential models for circular concatamer formation 
is depicted, along with the methods to experimentally differentiate which of these 
processes is active in muscle. Following co-infection of the tibialis muscle with 
AV.Alkphos and AV.GFP3.ori, all subsequently rescued plasmids arise solely from 
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circular intermediates containing AV.GFP3ori genomes. If rolling circular 
replication is the sole mechanism of concatamerization, only GFP expressing 
plasmids should be rescued. In contrast, if intermolecular recombination between 
independently formed monomer circular intermediates is the mechanism of 
concatamerization, both GFP and GFP/Alkphos expressing plasmids should be 
rescued. 

Figure 15. Co-infection of tibialis muscle of mice with AV.Alkphos and 
AV.GFP3ori. Transgene expression of rAAV infected tibialis muscle was 
determined at 14, 35, 80 (Panels A and A'), and 120 (Panels B-D) days following 
co-infection with 5 x 10 9 DNA particles each of AV.Alkphos and AV.GFP3ori. The 
time course of transgene expression started around 14 days and peaked by 35-80 
days. The extent of co-infection of myofibers with both Alkphos and GFP rAAV 
was determined in serial sections of 80 and 120 day post-infection muscle samples. 
Panels A-C represent GFP fluorescence of formalin fixed, cryoprotected sections, 
while panels A'-C depict the histochemical staining for Alkaline phosphatase in 
adjacent serial sections. A short staining time (7 minutes) was necessary to observe 
variation in staining levels for comparison to GFP. It was found that longer staining 
times (30 minutes) saturated the Alkphos signal. The boxed region in panels B and 
B' are enlarged in panels C and C', respectively. A more precise correlation of GFP 
and Alkphos staining in myofibers is given in Panel D in which co-localization of 
GFP and Alkphos expression was examined in the same section of a 120 day post- 
infected sample. This was performed by photographing the GFP fluorescent image 
prior to staining for Alkphos activity. The left panel of D shows a high power 
Nomarski photomicrograph of a group of myofibers (traced in red) , while the 
corresponding GFP and Alkphos staining patterns are shown in the right panel. 
Photomicrographs of Alkphos staining were taken with a red filter to allow for 
superimposition of staining patterns with GFP fluorescence. Co-expression of 
Alkphos and GFP is shown within myofibers as a yellow/orange color. Myofibers 
are marked as follows: (-) negative for both Alkphos and GFP, (*) positive for only 
GFP, and (+) positive for both GFP and Alkphos. 
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Figure 16. Rescue of circular intermediates and characterization of DNA 
hybridization patterns. Using the ampicillin resistance gene (amp) and bacterial ori 
incorporated into the AV.GFP3ori vector, the extent of circular intermediate 
formation was assessed by rescuing amp resistant plasmids following transformation 
of 1/5 the isolated Hirt DNA into E. coli Sure cells. Twenty plasmids from each 
muscle sample were prepared and analyzed by slot blot hybridization against GFP, 
Alkphos, and Amp 32 P-labeled DNA probes. A representative group demonstrating 
the hybridization patterns is shown in Panel A. Panel B depicts the mean (+/-SEM) 
number of rescued bacterial plasmids that hybridized to either GPF alone, or to both 
GFP and Alkphos probes, following transformation of l/5 th of the Hirt DNA. These 
numbers were calculated from the percentage of plasmids hybridizing to GPF and/or 
Alkphos and the total CFU plating efficiency derived from the original 
transformation. In total, 3 independent muscle samples were analyzed for a total of 
60 plasmids at each time point. The percentage of GFP hybridization positive 
rescued plasmids that also demonstrated hybridization to Alkphos is shown in 
Panel C. These data demonstrate an increase in the abundance of rescued 
GFP/Alkphos co-encoding circular intermediates over time. 

Figure 17. Transgene expression from rescued circular intermediates. 
Rescued circular intermediate plasmids were transfected into 293 cells for 
assessment of their ability to express encoded transgenes. In these studies all GFP 
hybridization positive clones from at least two muscles were tested for each time 
point and scored for their ability to express GFP and Alkaline phosphatase. In total 
at least 40 clones were evaluated for each time point. Three patterns of transgene 
expression were observed following transfection of these plasmids: I) no gene 
expression (Panel A), II) GFP expression only (Panel B), and III) GFP and Alkphos 
expression (Panel C). Panels A-C depict Nomarski photomicrographs (left) of GFP 
fluorescent fields (center) and Alkphos staining of a different field from the same 
culture (right). The percentage of GFP hybridization positive clones that also 
expressed GFP is shown in Panel D. Additionally, this panel illustrates the 
percentage of GFP expressing clones also expressing Alkphos. 
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Figure 18. Structural analysis of bi-functional concatamer circular 
intermediates. To fully characterize the nature of GFP and Alkphos co-expressing 
circular intermediates, detailed structural analyses were performed using restriction 
enzyme mapping and Southern blot hybridization with GFP, Alkphos, and ITR 32 P- 
labeled probes. Results from Southern blot analysis of plasmid clone #33 (Panel A) 
and clone #5 (Panel C) are given as representative examples of circular 
intermediates isolated from 80 and 35 day Hirt DNA of rAAV infected muscle, 
respectively. Agarose gels were run in triplicate for each of these clones and 
Southern blot filters were hybridized with one of the three DNA probes as indicated 
below each autoradiogram. Molecular weights (kb) are indicated to the left of the 
ethidium stained agarose gel and restriction enzymes are marked on the top of each 
gel/filter. Panels B and D give the deduced structure of plasmid clones #33 and #5, 
respectively, as based on Southern blot analysis. For ease of comparison with the 
restriction maps of the viral genomes given in Figure 14A, the position of restriction 
enzyme sites (kb) are marked with the indicated orientation of intact viral genomes. 
However, in clone #33 a deletion occurred between the Asel and Hindlll site of a 
head-to-tail array between AV.Alkphos and AV.GFP3ori, as reflected by a 900 bp 
reduction in the anticipated size of Hindlll/NotI and Clal/Asel fragments (marked 
by asterisks in Panel A). Furthermore, the SphI site flanking an ITR was ablated in 
clone #5 (bands effected by this deletion are marked by asterisks in Panel C). The 
deletion is not reflected in the overall concatamer since the exact region involved 
and/or the size of the deletion is unclear. Additionally, chemical sequence evidence 
of rescued circular intermediates suggests that the predominant form of ITR arrays 
may be in a double-D structure (i.e., one ITR flanked by two D-sequence rather than 
two ITRs) and hence ITR arrays containing fragments may appear 147 bp shorter 
than indicated. However, to more easily depict the orientation of viral genomes, the 
position of 5'and 3' ITRs is indicated rather than representing a single ITR at these 
junctions. 

Figure 19. Application of rAAV circular concatamers to deliver trans- 
splicing vectors with large gene inserts. Panel A depicts two rAAV vectors 
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encoding two halves of a cDNA (red) and flanked by splice site consensus 
sequences (brown). Panel B depicts one potential type of intermolecular concatamer 
following co-infection of cells with the independent vectors shown in panel A. Full 
length transgene mRNA can then be produced by splicing. Panel C depicts two 
rAAV vectors encoding two halves of a CFTR DNA flanked by a promoter and 
splice donor or a splice acceptor and a poly A sequence, respectively. Panel D 
shows one potential type of intermolecular concatamer following co-infection of 
cells with the vectors in Panel C. 

Figure 20. Schematic representation of the rAAV vectors used for cis- 
activation. 

Figure 21. Strategy for enhancing rAAV gene expression through 
intermolecular c/s-activation. Two independent rAAV viruses, one encoding a 
transgene with or without a minimal promoter (e.g., AV.SV(P)Luc) and another 
harboring enhancer sequences (e.g., AV.SupEnh), were used to co-infect the same 
tissue. Subsequent concatamerization between two rAAV vectors substantially 
augments expression of the transgene, due to the presence of enhancer elements 
within the same circularized molecule. 

Figure 22. Intermolecular c/s-activation increases rAAV mediated gene 
transfer in fibroblasts. Human fibroblast cells were infected with the indicated 
rAAV vector(s) at an moi of 1000 for each individual vector. Luciferase activity 
was examined at 3 days post-infection. The data represent the mean +/- SEM of 6 
independent samples for each experimental condition. 

Figure 23. Intermolecular cis-activation increases rAAV mediated gene 
transfer to muscle in vivo. Mouse tibialis anterior muscles were infected with the 
indicated rAAV vector(s) at 2 x 10 10 particles per viral vector in a total volume of 30 
jxl PBS. The luciferase activity in rAAV infected or mock infected (PBS) muscles 
was examined at 30 days (Panel A) and 90 days (Panel B) post-infection. The data 
represent the mean +/- SEM of 6 independent muscle samples for each 
experimental condition. Co-administration of the AV.SupEnh vector harboring 
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enhancer elements substantially enhanced rAAV mediated luciferase expression in 
muscle from both the ITR and the minimal SV40 promoter. 

Figure 24. Viral constructs for the generation of autonomously replicating 
rAAV vectors as circular concatamers. Panel A depicts two rAAV constructs used 
to test this hypothesis. One encodes the GFP transgene (green) and the other 
encodes the EBNA-1 (red) and OriP (purple) sequences necessary for autonomous 
replication. Additionally, sequences encoded within the GFP vector allow for 
rescue of circular intermediates in bacteria. Panel B depicts one potential type of 
intermolecular concatamer following co-infection of the independent vectors shown 
in panel A. 

Figure 25. rAAV vectors used to generate a trans -splicing vector 
expressing genomic epo DNA. Panel A shows a schematic of the vectors. An IRES 
sequence and EGFP gene are included in one of the vectors to allow for direct 
visualization of transgene expression. Panel B depicts a potential circular 
concatamer formed after co-infection. The dashed lines indicate the splicing 
pattern. Panel C shows the hnRNA, splicing pattern and mature mRNA transcripts 
which result from circular concatamerization of the two vectors. 

Figure 26. Production of full length Epo protein following co-infection of 
primary fibroblasts with two independent trans-splicing vectors. Confluent primary 
fibroblasts (8 x 10 5 cells) were infected with 7 x 10 9 particles of each AV.Epol 
and/or AV.Epo2. Epo expression was monitored by ELISA (R&D Systems, 
Minneapolis, MN) by harvesting culture media 24 hours following media 
replacement at the indicated time points. Results are presented as the mean Epo 
level (n = 3) normalized to 1 x 10 5 cells. 

Figure 27. Functional expression of human Epo in vivo using trans-splicing 
AAV vectors. Hematocrits of the C57BL/6 mice were determined at 10, 21, 35, 48, 
63, 80, and 94 days following infection with 3 x 10 10 particles of each independent 
vector either together in one tibialis muscle (n = 6, denoted as solid squares) or 
independently by administration of AV.Epol to the right tibialis muscle and 
AV.Epo2 to the left tibialis muscle of the same mouse (n = 4, denoted by solid 
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triangles). As a control for baseline, the serum of uninfected mice (n = 4, denoted 
by open circles) was assayed. 

Detailed Description of the Invention 

Definitions 

As used herein, the terms "isolated and/or purified" refer to in vitro 
preparation, isolation and/or purification of a nucleic acid molecule of the invention, 
so that it is not associated with in vivo substances. 

As used herein, a DNA molecule, sequence or segment of the invention 
preferably is biologically active. A biologically active DNA molecule of the 
invention has at least about 1%, more preferably at least about 10%, and more 
preferably at least about 50%, of the activity of a DNA molecule comprising ITR 
sequences from a circular intermediate of AAV, e.g., a DNA molecule comprising 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, or a subunit or variant 
thereof. The activity of a nucleic acid molecule of the invention can be measured by 
methods well known to the art, some of which are described hereinbelow. For 
example, the presence of the DNA molecule in a recombinant nucleic acid molecule 
in a host cell results in episomal persistence and/or increased abundance of the 
recombinant molecule in those cells relative to corresponding cells having a 
recombinant nucleic acid molecule lacking a DNA molecule of the invention. 

A variant DNA molecule, sequence or segment of the invention has at least 
about 70%, preferably at least about 80%, and more preferably at least about 90%, 
but less than 100%, contiguous nucleotide sequence homology or identity to a DNA 
molecule comprising ITR sequences from a circular intermediate of AAV, e.g., SEQ 
ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, a subunit thereof. A variant 
DNA molecule of the invention may include nucleotide bases not present in SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, e.g., 5', 3' or internal deletions 
or insertions, such as the insertion of a restriction endonuclease recognition site, so 
long as these bases do not substantially reduce the biological activity of the 
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molecule. A substantial reduction in activity means a reduction in activity of greater 
than about 50%, preferably greater than about 90%. 

I. Identification of Nucleic Acid Molecules Falling Within the Scope of the 
Invention 

A. Nucleic Acid Molecules of the Invention 

1 . Sources of the Nucleic Acid Molecules of the Invention 

Sources of nucleotide sequences from which the present nucleic acid 
molecules can be obtained include AAV infected cells, e.g., any vertebrate, 
preferably mammalian, cellular source. 

As used herein, the terms "isolated and/or purified" refer to in vitro isolation 
of a nucleic acid, e.g., DNA molecule from its natural cellular environment, and 
from association with other components of the cell, such as nucleic acid or 
polypeptide, so that it can be sequenced, replicated, and/or expressed. For example, 
"isolated nucleic acid" is RNA or DNA containing greater than about 50, preferably 
about 300, and more preferably about 500 or more, sequential nucleotide bases that 
comprise a DNA segment from a circular intermediate of AAV which contains at 
least a portion of the 5' and 3' ITRs and the D sequence, or a variant thereof, that is 
complementary or hybridizes, respectively, to AAV ITR DNA and remains stably 
bound under stringent conditions, as defined by methods well known in the art, e.g., 
in Sambrook et al., 1989. Thus, the RNA or DNA is "isolated" in that it is free from 
at least one contaminating nucleic acid with which it is normally associated in the 
natural source of the RNA or DNA and is preferably substantially free of any other 
mammalian RNA or DNA. The phrase "free from at least one contaminating source 
nucleic acid with which it is normally associated" includes the case where the 
nucleic acid is reintroduced into the source or natural cell but is in a different 
chromosomal location or is otherwise flanked by nucleic acid sequences not 
normally found in the source cell, e.g., in a vector or plasmid. An example of 
isolated nucleic acid within the scope of the invention is nucleic acid that shares at 
least about 80%, preferably at least about 90%, and more preferably at least about 
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95%, sequence identity with SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5 or SEQ 
ID NO:6, or a subunit thereof. 

As used herein, the term "recombinant nucleic acid" or "preselected nucleic 
acid," e.g., "recombinant DNA sequence or segment" or "preselected DNA 
5 sequence or segment" refers to a nucleic acid, e.g., to DNA, that has been derived or 
isolated from any appropriate cellular source, that may be subsequently chemically 
altered in vitro, so that its sequence is not naturally occurring, or corresponds to 
naturally occurring sequences that are not positioned as they would be positioned in 
a genome which has not been transformed with exogenous DNA. An example of 
10 preselected DNA "derived" from a source, would be a DNA sequence that is 
identified as a useful fragment within a given organism, and which is then 
chemically synthesized in essentially pure form. An example of such DNA 
"isolated" from a source would be a useful DNA sequence that is excised or 
00 removed from said source by chemical means, e.g., by the use of restriction 

yi 15 endonucleases, so that it can be further manipulated, e.g., amplified, for use in the 

kl invention, by the methodology of genetic engineering. 

a Thus, recovery or isolation of a given fragment of DNA from a restriction 

q digest can employ separation of the digest on polyacrylamide or agarose gel by 

electrophoresis, identification of the fragment of interest by comparison of its 
p 20 mobility versus that of marker DNA fragments of known molecular weight, removal 

^ of the gel section containing the desired fragment, and separation of the gel from 

DNA. See Lawn et al., Nucleic Acids Res. . 9, 6103 (1981), and Goeddel et al., 
Nucleic Acids Res. . 8, 4057 (1980). Therefore, "preselected DNA" includes 
completely synthetic DNA sequences, semi-synthetic DNA sequences, DNA 
25 sequences isolated from biological sources, and DNA sequences derived from RNA, 
as well as mixtures thereof. 

Nucleic acid molecules having base pair substitutions (i.e., variants) are 
prepared by a variety of methods known in the art. These methods include, but are 
not limited to, isolation from a natural source (in the case of naturally occurring 
30 sequence variants) or preparation by oligonucleo tide-mediated (or site-directed) 
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mutagenesis, PCR mutagenesis, and cassette mutagenesis of an earlier prepared 
variant or a non-variant version of the nucleic acid molecule. 

Oligonucleotide-mediated mutagenesis is a preferred method for preparing 
substitution variants. This technique is well known in the art as described by 
5 Adelman et al., DNA, 2, 183 (1983). Briefly, AAV DNA is altered by hybridizing 
an oligonucleotide encoding the desired mutation to a DNA template, where the 
template is the single-stranded form of a plasmid or bacteriophage containing the 
unaltered or native DNA sequence of AAV. After hybridization, a DNA 
polymerase is used to synthesize an entire second complementary strand of the 

10 template that will thus incorporate the oligonucleotide primer, and will code for the 
selected alteration in the AAV DNA. 

Generally, oligonucleotides of at least 25 nucleotides in length are used. An 
optimal oligonucleotide will have 12 to 15 nucleotides that are completely 
complementary to the template on either side of the nucleotide(s) coding for the 

15 mutation. This ensures that the oligonucleotide will hybridize properly to the 
single-stranded DNA template molecule. The oligonucleotides are readily 
synthesized using techniques known in the art such as that described by Crea et al., 
Proc. Natl. Acad. Sci. U.S.A. , 75, 5765 (1978). 

The DNA template can be generated by those vectors that are either derived 

20 from bacteriophage Ml 3 vectors (the commercially available M13mpl8 and 

M13mpl9 vectors are suitable), or those vectors that contain a single- stranded phage 
origin of replication as described by Viera et al., Meth. EnzvmoL 153 , 3 (1987). 
Thus, the DNA that is to be mutated may be inserted into one of these vectors to 
generate single-stranded template. Production of the single-stranded template is 

25 described in Sections 4.21-4.41 of Sambrook et al, Molecular Cloning: A 
Laboratory Manual (Cold Spring Harbor Laboratory Press, N.Y. 1989). 

Alternatively, single-stranded DNA template may be generated by 
denaturing double-stranded plasmid (or other) DNA using standard techniques. 

For alteration of the native DNA sequence (to generate amino acid sequence 

30 variants, for example), the oligonucleotide is hybridized to the single-stranded 
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template under suitable hybridization conditions. A DNA polymerizing enzyme, 
usually the Klenow fragment of DNA polymerase I, is then added to synthesize the 
complementary strand of the template using the oligonucleotide as a primer for 
synthesis. A heteroduplex molecule is thus formed such that one strand of DNA 
encodes the mutated form of AAV, and the other strand (the original template) 
encodes the native, unaltered sequence of AAV. This heteroduplex molecule is then 
transformed into a suitable host cell, usually a prokaryote such as E. coli JM101. 
After the cells are grown, they are plated onto agarose plates and screened using the 
oligonucleotide primer radiolabeled with 32-phosphate to identify the bacterial 
colonies that contain the mutated DNA. The mutated region is then removed and 
placed in an appropriate vector, generally an expression vector of the type typically 
employed for transformation of an appropriate host. 

The method described immediately above may be modified such that a 
homoduplex molecule is created wherein both strands of the plasmid contain the 
mutations(s). The modifications are as follows: The single-stranded 
oligonucleotide is annealed to the single-stranded template as described above. A 
mixture of three deoxyribonucleotides, deoxyriboadenosine (dATP),' 
deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), is combined with a 
modified thiodeoxyribocytosine called dCTP-(aS) (which can be obtained from the 
Amersham Corporation). This mixture is added to the template-oligonucleotide 
complex. Upon addition of DNA polymerase to this mixture, a strand of DNA 
identical to the template except for the mutated bases is generated. In addition, this 
new strand of DNA will contain dCTP-(aS) instead of dCTP, which serves to 
protect it from restriction endonuclease digestion. 

After the template strand of the double-stranded heteroduplex is nicked with 
an appropriate restriction enzyme, the template strand can be digested with ExoIII 
nuclease or another appropriate nuclease past the region that contains the site(s) to 
be mutagenized. The reaction is then stopped to leave a molecule that is only 
partially single-stranded. A complete double-stranded DNA homoduplex is then 
formed using DNA polymerase in the presence of all four deoxyribonucleotide 
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triphosphates, ATP, and DNA ligase. This homoduplex molecule can then be 
transformed into a suitable host cell such as E. coli JM1 01 . 

For example, a preferred embodiment of the invention is an isolated and 
purified DNA molecule comprising a DNA segment comprising SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, a subunit thereof or a variant thereof 
having nucleotide substitutions, or deletions or insertions. 
II, Preparation of Molecules Useful to Practice the Methods of the Invention 
A. Nucleic Acid Molecules 
1 . Chimeric Expression Cassettes 

To prepare expression cassettes for transformation herein, the recombinant 
or preselected DNA sequence or segment may be circular or linear, double-stranded 
or single-stranded. Generally, the preselected DNA sequence or segment is in the 
form of chimeric DNA, such as plasmid DNA, that can also contain coding regions 
flanked by control sequences which promote the expression of the preselected DNA 
present in the resultant cell line. 

As used herein, "chimeric" means that a vector comprises DNA from at least 
two different species, or comprises DNA from the same species, which is linked or 
associated in a manner which does not occur in the "native" or wild type of the 
species. 

Aside from the preselected DNA sequences described above, a portion of the 
preselected DNA may serve a regulatory or a structural function. For example, the 
preselected DNA may itself comprise a promoter that is active in mammalian cells, 
or may utilize a promoter already present in the genome that is the transformation 
target. Such promoters include the CMV promoter, as well as the S V40 late 
promoter and retroviral LTRs (long terminal repeat elements), although many other 
promoter elements well known to the art may be employed in the practice of the 
invention. 

Other elements functional in the host cells, such as introns, enhancers, 
polyadenylation sequences and the like, may also be a part of the preselected DNA. 
Such elements may or may not be necessary for the function of the DNA, but may 
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provide improved expression of the DNA by affecting transcription, stability of the 
mRNA, or the like. Such elements may be included in the DNA as desired to obtain 
the optimal performance of the transforming DNA in the cell. 

"Control sequences" is defined to mean DNA sequences necessary for the 
expression of an operably linked coding sequence in a particular host organism. The 
control sequences that are suitable for prokaryotic cells, for example, include a 
promoter, and optionally an operator sequence, and a ribosome binding site. 
Eukaryotic cells are known to utilize promoters, polyadenylation signals, and 
enhancers. 

"Operably linked" is defined to mean that the nucleic acids are placed in a 
functional relationship with another nucleic acid sequence. For example, DNA for a 
presequence or secretory leader is operably linked to DNA for a peptide or 
polypeptide if it is expressed as a preprotein that participates in the secretion of the 
peptide or polypeptide; a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the sequence; or a ribosome binding site is 
operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are 
contiguous and, in the case of a secretory leader, contiguous and in reading phase. 
However, enhancers do not have to be contiguous. Linking is accomplished by 
ligation at convenient restriction sites. If such sites do not exist, the synthetic 
oligonucleotide adaptors or linkers are used in accord with conventional practice. 

The preselected DNA to be introduced into the cells further will generally 
contain either a selectable marker gene or a reporter gene or both to facilitate 
identification and selection of transformed cells from the population of cells sought 
to be transformed. Alternatively, the selectable marker may be carried on a separate 
piece of DNA and used in a co-transformation procedure. Both selectable markers 
and reporter genes may be flanked with appropriate regulatory sequences to enable 
expression in the host cells. Useful selectable markers are well known in the art and 
include, for example, antibiotic and herbicide-resistance genes, such as neo, hpt, 
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dhfr, bar, aroA, dapA and the like. See also, the genes listed on Table 1 of 
Lundquist et al. (U.S. Patent No. 5,848,956). 

Reporter genes are used for identifying potentially transformed cells and for 
evaluating the functionality of regulatory sequences. Reporter genes which encode 
for easily assayable proteins are well known in the art. In general, a reporter gene is 
a gene which is not present in or expressed by the recipient organism or tissue and 
which encodes a protein whose expression is manifested by some easily detectable 
property, e.g., enzymatic activity. Preferred genes include the chloramphenicol 
acetyl transferase gene (cat) from Tn9 of E. coli, the beta-glucuronidase gene (gus) 
of the uidA locus of E. coli, and the luciferase gene from firefly Photinus pyralis. 
Expression of the reporter gene is assayed at a suitable time after the DNA has been 
introduced into the recipient cells. 

The general methods for constructing recombinant DNA which can 
transform target cells are well known to those skilled in the art, and the same 
compositions and methods of construction may be utilized to produce the DNA 
useful herein. For example, J. Sambrook et al., Molecular Cloning: A Laboratory 
Manual Cold Spring Harbor Laboratory Press (2d ed., 1989), provides suitable 
methods of construction. 
2. Transformation into Host Cells 

The recombinant DNA can be readily introduced into the host cells, e.g., 
mammalian, bacterial, yeast or insect cells by transfection with an expression vector 
of the invention, by any procedure useful for the introduction into a particular cell, 
e.g., physical or biological methods, to yield a transformed cell having the 
recombinant DNA stably integrated into its genome or present as an episome which 
can persist in the transformed cells, so that the DNA molecules, sequences, or 
segments, of the present invention are maintained and/or expressed by the host cell. 

Physical methods to introduce a preselected DNA into a host cell include 
calcium phosphate precipitation, lipofection, particle bombardment, microinjection, 
electroporation, and the like. Biological methods to introduce the DNA of interest 
into a host cell include the use of DNA and RNA viral vectors. The main advantage 
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of physical methods is that they are not associated with pathological or oncogenic 
processes of viruses. However, they are less precise, often resulting in multiple 
copy insertions, random integration, disruption of foreign and endogenous gene 
sequences, and unpredictable expression. 

As used herein, the term "cell line" or "host cell" is intended to refer to well- 
characterized homogenous, biologically pure populations of cells. These cells may 
be eukaryotic cells that are neoplastic or which have been "immortalized" in vitro by 
methods known in the art, as well as primary cells, or prokaryotic cells. The cell line 
or host cell is preferably of mammalian origin, but cell lines or host cells of non- 
mammalian origin may be employed, including plant, insect, yeast, fungal or 
bacterial sources. Generally, the preselected DNA sequence is related to a DNA 
sequence which is resident in the genome of the host cell but is not expressed, or not 
highly expressed, or, alternatively, overexpressed. 

"Transfected" or "transformed" is used herein to include any host cell or cell 
line, the genome of which has been altered or augmented by the presence of at least 
one preselected DNA sequence, which DNA is also referred to in the art of genetic 
engineering as "heterologous DNA," "recombinant DNA," "exogenous DNA," 
"genetically engineered," "non-native," or "foreign DNA," wherein said DNA was 
isolated and introduced into the genome of the host cell or cell line by the process of 
genetic engineering. The host cells of the present invention are typically produced 
by transfection with a DNA sequence in a plasmid expression vector, a viral 
expression vector, or as an isolated linear DNA sequence. 

To confirm the presence of the preselected DNA sequence in the host cell, a 
variety of assays may be performed. Such assays include, for example, "molecular 
biological" assays well known to those of skill in the art, such as Southern and 
Northern blotting, RT-PCR and PCR; "biochemical" assays, such as detecting the 
presence of a polypeptide expressed from a gene present in the vector, e.g., by 
immunological means (immunoprecipitations, immunoaffinity columns, ELISAs 
and Western blots) or by any other assay useful to identify molecules falling within 
the scope of the invention. 
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To detect and quantitate RNA produced from introduced DNA segments, 
RT-PCR may be employed. In this application of PCR, it is first necessary to 
reverse transcribe RNA into DNA, using enzymes such as reverse transcriptase, and 
then through the use of conventional PCR techniques amplify the DNA. In most 
instances PCR techniques, while useful, will not demonstrate integrity of the RNA 
product. Further information about the nature of the RNA product may be obtained 
by Northern blotting. This technique demonstrates the presence of an RNA species 
and gives information about the integrity of that RNA. The presence or absence of 
an RNA species can also be determined using dot or slot blot Northern 
hybridizations. These techniques are modifications of Northern blotting and only 
demonstrate the presence or absence of an RNA species. 

While Southern blotting and PCR may be used to detect the DNA segment in 
question, they do not provide information as to whether the DNA segment is being 
expressed. Expression may be evaluated by specifically identifying the polypeptide 
products of the introduced DNA sequences or evaluating the phenotypic changes 
brought about by the expression of the introduced DNA segment in the host cell. 
III. Dosages, Formulations and Routes of Administration 

Administration of a nucleic acid molecule may be accomplished through the 
introduction of cells transformed with the nucleic acid molecule (see, for example, 
WO 93/02556), the administration of the nucleic acid molecule itself (see, for 
example, Feigner et aL, U.S. Patent No. 5,580,859, Pardoll et al., Immunity , 3, 165 
(1995); Stevenson et al., Immunol. Rev. , 145 , 211 (1995); Moiling, J. Mol. Med. , 
75, 242 (1997); Donnelly et al., Ann. N.Y. Acad. Sci. , 772, 40 (1995); Yang et al., 
Mol. Med. Today . 2, 476 (1996); Abdallah et al., Biol. Cell , 85, 1 (1995)), through 
infection with a recombinant virus or via liposomes. Pharmaceutical formulations, 
dosages and routes of administration for nucleic acids are generally disclosed, for 
example, in Feigner et al., supra. 

Administration of the therapeutic agents in accordance with the present 
invention may be continuous or intermittent, depending, for example, upon the 
recipients physiological condition, whether the purpose of the administration is 
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therapeutic or prophylactic, and other factors known to skilled practitioners. The 
administration of the agents of the invention may be essentially continuous over a 
preselected period of time or may be in a series of spaced doses. Both local and 
systemic administration is contemplated. When the molecules of the invention are 
employed for prophylactic purposes, agents of the invention are amenable to chronic 
use, preferably by systemic administration. 

One or more suitable unit dosage forms comprising the therapeutic agents of 
the invention, which, as discussed below, may optionally be formulated for 
sustained release, can be administered by a variety of routes including oral, or 
parenteral, including by rectal, transdermal, subcutaneous, intravenous, 
intramuscular, intraperitoneal, intrathoracic, intrapulmonary and intranasal routes. 
The formulations may, where appropriate, be conveniently presented in discrete unit 
dosage forms and may be prepared by any of the methods well known to pharmacy. 
Such methods may include the step of bringing into association the therapeutic 
agent with liquid carriers, solid matrices, semi-solid carriers, finely divided solid 
carriers or combinations thereof, and then, if necessary, introducing or shaping the 
product into the desired delivery system. 

When the therapeutic agents of the invention are prepared for oral 
administration, they are preferably combined with a pharmaceutically acceptable 
carrier, diluent or excipient to form a pharmaceutical formulation, or unit dosage 
form. The total active ingredients in such formulations comprise from 0.1 to 99.9% 
by weight of the formulation. By "pharmaceutically acceptable" it is meant the 
carrier, diluent, excipient, and/or salt must be compatible with the other ingredients 
of the formulation, and not deleterious to the recipient thereof. The active 
ingredient for oral administration may be present as a powder or as granules; as a 
solution, a suspension or an emulsion; or in achievable base such as a synthetic resin 
for ingestion of the active ingredients from a chewing gum. The active ingredient 
may also be presented as a bolus, electuary or paste. 

Pharmaceutical formulations containing the therapeutic agents of the 
invention can be prepared by procedures known in the art using well known and 
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readily available ingredients. For example, the agent can be formulated with 
common excipients, diluents, or carriers, and formed into tablets, capsules, 
suspensions, powders, and the like. Examples of excipients, diluents, and carriers 
that are suitable for such formulations include the following fillers and extenders 
such as starch, sugars, mannitol, and silicic derivatives; binding agents such as 
carboxymethyl cellulose, HPMC and other cellulose derivatives, alginates, gelatin, 
and polyvinylpyrrolidone; moisturizing agents such as glycerol; disintegrating 
agents such as calcium carbonate and sodium bicarbonate; agents for retarding 
dissolution such as paraffin; resorption accelerators such as quaternary ammonium 
compounds; surface active agents such as cetyl alcohol, glycerol monostearate; 
adsorptive carriers such as kaolin and bentonite; and lubricants such as talc, calcium 
and magnesium stearate, and solid polyethyl glycols. 

For example, tablets or caplets containing the agents of the invention can 
include buffering agents such as calcium carbonate, magnesium oxide and 
magnesium carbonate. Caplets and tablets can also include inactive ingredients such 
as cellulose, pregelatinized starch, silicon dioxide, hydroxy propyl methyl cellulose, 
magnesium stearate, microcrystalline cellulose, starch, talc, titanium dioxide, 
benzoic acid, citric acid, corn starch, mineral oil, polypropylene glycol, sodium 
phosphate, and zinc stearate, and the like. Hard or soft gelatin capsules containing 
an agent of the invention can contain inactive ingredients such as gelatin, 
microcrystalline cellulose, sodium lauryl sulfate, starch, talc, and titanium dioxide, 
and the like, as well as liquid vehicles such as polyethylene glycols (PEGs) and 
vegetable oil. Moreover, enteric coated caplets or tablets of an agent of the invention 
are designed to resist disintegration in the stomach and dissolve in the more neutral 
to alkaline environment of the duodenum. 

The therapeutic agents of the invention can also be formulated as elixirs or 
solutions for convenient oral administration or as solutions appropriate for 
parenteral administration, for instance by intramuscular, subcutaneous or 
intravenous routes. 
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The pharmaceutical formulations of the therapeutic agents of the invention 
can also take the form of an aqueous or anhydrous solution or dispersion, or 
alternatively the form of an emulsion or suspension. 

Thus, the therapeutic agent may be formulated for parenteral administration 
5 (e.g., by injection, for example, bolus injection or continuous infusion) and may be 
presented in unit dose form in ampules, pre-filled syringes, small volume infusion 
containers or in multi-dose containers with an added preservative. The active 
ingredients may take such forms as suspensions, solutions, or emulsions in oily or 
aqueous vehicles, and may contain formulatory agents such as suspending, 
10 stabilizing and/or dispersing agents. Alternatively, the active ingredients may be in 
powder form, obtained by aseptic isolation of sterile solid or by lyophilization from 
solution, for constitution with a suitable vehicle, e.g., sterile, pyrogen-free water, 
before use. 

p These formulations can contain pharmaceutically acceptable vehicles and 

Ul 15 adjuvants which are well known in the prior art. It is possible, for example, to 

yi 

jg prepare solutions using one or more organic solvent(s) that is/are acceptable from 
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the physiological standpoint, chosen, in addition to water, from solvents such as 



Q acetone, ethanol, isopropyl alcohol, glycol ethers such as the products sold under the 

Q 

ffi name "Dowanol", polyglycols and polyethylene glycols, C r C 4 alkyl esters of short- 

y 20 chain acids, preferably ethyl or isopropyl lactate, fatty acid triglycerides such as the 

O 

products marketed under the name "Miglyol", isopropyl myristate, animal, mineral 

and vegetable oils and polysiloxanes. 

The compositions according to the invention can also contain thickening 

agents such as cellulose and/or cellulose derivatives. They can also contain gums 
25 such as xanthan, guar or carbo gum or gum arabic, or alternatively polyethylene 

glycols, bentones and montmorillonites, and the like. 

It is possible to add, if necessary, an adjuvant chosen from antioxidants, 

surfactants, other preservatives, film- forming, keratolytic or comedolytic agents, 

perfumes and colorings. Also, other active ingredients may be added, whether for 
30 the conditions described or some other condition. 
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For example, among antioxidants, t-butylhydroquinone, butylated 
hydroxyanisole, butylated hydroxytoluene and a-tocopherol and its derivatives may 
be mentioned. The galenical forms chiefly conditioned for topical application take 
the form of creams, milks, gels, dispersion or microemulsions, lotions thickened to a 
greater or lesser extent, impregnated pads, ointments or sticks, or alternatively the 
form of aerosol formulations in spray or foam form or alternatively in the form of a 
cake of soap. 

Additionally, the agents are well suited to formulation as sustained release 
dosage forms and the like. The formulations can be so constituted that they release 
the active ingredient only or preferably in a particular part of the intestinal or 
respiratory tract, possibly over a period of time. The coatings, envelopes, and 
protective matrices may be made, for example, from polymeric substances, such as 
polylactide-glycolates, liposomes, microemulsions, microparticles, nanoparticles, or 
waxes. These coatings, envelopes, and protective matrices are useful to coat 
indwelling devices, e.g., stents, catheters, peritoneal dialysis tubing, and the like. 

The therapeutic agents of the invention can be delivered via patches for 
transdermal administration. See U.S. Patent No. 5,560,922 for examples of patches 
suitable for transdermal delivery of a therapeutic agent. Patches for transdermal 
delivery can comprise a backing layer and a polymer matrix which has dispersed or 
dissolved therein a therapeutic agent, along with one or more skin permeation 
enhancers. The backing layer can be made of any suitable material which is 
impermeable to the therapeutic agent. The backing layer serves as a protective 
cover for the matrix layer and provides also a support function. The backing can be 
formed so that it is essentially the same size layer as the polymer matrix or it can be 
of larger dimension so that it can extend beyond the sidfe of the polymer matrix or 
overlay the side or sides of the polymer matrix and then can extend outwardly in a 
manner that the surface of the extension of the backing layer can be the base for an 
adhesive means. Alternatively, the polymer matrix can contain, or be formulated of, 
an adhesive polymer, such as polyacrylate or acrylate/vinyl acetate copolymer. For 
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long-term applications it might be desirable to use microporous and/or breathable 
backing laminates, so hydration or maceration of the skin can be minimized. 

Examples of materials suitable for making the backing layer are films of 
high and low density polyethylene, polypropylene, polyurethane, polyvinylchloride, 
polyesters such as poly(ethylene phthalate), metal foils, metal foil laminates of such 
suitable polymer films, and the like. Preferably, the materials used for the backing 
layer are laminates of such polymer films with a metal foil such as aluminum foil. 
In such laminates, a polymer film of the laminate will usually be in contact with the 
adhesive polymer matrix. 

The backing layer can be any appropriate thickness which will provide the 
desired protective and support functions. A suitable thickness will be from about 10 
to about 200 microns. 

Generally, those polymers used to form the biologically acceptable adhesive 
polymer layer are those capable of forming shaped bodies, thin walls or coatings 
through which therapeutic agents can pass at a controlled rate. Suitable polymers 
are biologically and pharmaceutically compatible, nonallergenic and insoluble in 
and compatible with body fluids or tissues with which the device is contacted. The 
use of soluble polymers is to be avoided since dissolution or erosion of the matrix 
by skin moisture would affect the release rate of the therapeutic agents as well as the 
capability of the dosage unit to remain in place for convenience of removal. 

Exemplary materials for fabricating the adhesive polymer layer include 
polyethylene, polypropylene, polyurethane, ethylene/propylene copolymers, 
ethylene/ethylacrylate copolymers, ethylene/vinyl acetate copolymers, silicone 
elastomers, especially the medical-grade polydimethylsiloxanes, neoprene rubber, 
polyisobutylene, polyacrylates, chlorinated polyethylene, polyvinyl chloride, vinyl 
chloride-vinyl acetate copolymer, crosslinked polymethacrylate polymers (hydro- 
gel), polyvinylidene chloride, poly(ethylene terephthalate), butyl rubber, 
epichlorohydrin rubbers, ethylenvinyl alcohol copolymers, ethylene- 
vinyloxyethanol copolymers; silicone copolymers, for example, polysiloxane- 
polycarbonate copolymers, polysiloxanepolyethylene oxide copolymers, 
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polysiloxane-polymethacrylate copolymers, polysiloxane-alkylene copolymers (e.g., 
polysiloxane-ethylene copolymers), polysiloxane-alkylenesilane copolymers (e.g., 
polysiloxane-ethylenesilane copolymers), and the like; cellulose polymers, for 
example methyl or ethyl cellulose, hydroxy propyl methyl cellulose, and cellulose 
5 esters; polycarbonates; polytetrafluoroethylene; and the like. 

Preferably, a biologically acceptable adhesive polymer matrix should be 
selected from polymers with glass transition temperatures below room temperature. 
The polymer may, but need not necessarily, have a degree of crystallinity at room 
temperature. Cross-linking monomelic units or sites can be incorporated into such 

10 polymers. For example, cross-linking monomers can be incorporated into 
polyacrylate polymers, which provide sites for cross-linking the matrix after 
dispersing the therapeutic agent into the polymer. Known cross-linking monomers 
for polyacrylate polymers include polymethacrylic esters of polyols such as butylene 
diacrylate and dimethacrylate, trimethylol propane trimethacrylate and the like. 

15 Other monomers which provide such sites include allyl acrylate, allyl methacrylate, 
diallyl maleate and the like. 

Preferably, a plasticizer and/or humectant is dispersed within the adhesive 
polymer matrix. Water-soluble polyols are generally suitable for this purpose. 
Incorporation of a humectant in the formulation allows the dosage unit to absorb 

20 moisture on the surface of skin which in turn helps to reduce skin irritation and to 
prevent the adhesive polymer layer of the delivery system from failing. 

Therapeutic agents released from a transdermal delivery system must be 
capable of penetrating each layer of skin. In order to increase the rate of permeation 
of a therapeutic agent, a transdermal drug delivery system must be able in particular 

25 to increase the permeability of the outermost layer of skin, the stratum corneum, 
which provides the most resistance to the penetration of molecules. The fabrication 
of patches for transdermal delivery of therapeutic agents is well known to the art. 

For administration to the upper (nasal) or lower respiratory tract by 
inhalation, the therapeutic agents of the invention are conveniently delivered from 

30 an insufflator, nebulizer or a pressurized pack or other convenient means of 
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delivering an aerosol spray. Pressurized packs may comprise a suitable propellant 
such as dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, 
carbon dioxide or other suitable gas. In the case of a pressurized aerosol, the dosage 
unit may be determined by providing a valve to deliver a metered amount. 

Alternatively, for administration by inhalation or insufflation, the 
composition may take the form of a dry powder, for example, a powder mix of the 
therapeutic agent and a suitable powder base such as lactose or starch. The powder 
composition may be presented in unit dosage form in, for example, capsules or 
cartridges, or, e.g., gelatine or blister packs from which the powder may be 
administered with the aid of an inhalator, insufflator or a metered-dose inhaler. 

For intra-nasal administration, the therapeutic agent may be administered via 
nose drops, a liquid spray, such as via a plastic bottle atomizer or metered-dose 
inhaler. Typical of atomizers are the Mistometer (Wintrop) and the Medihaler 
(Riker). 

The local delivery of the therapeutic agents of the invention can also be by a 
variety of techniques which administer the agent at or near the site of disease. 
Examples of site-specific or targeted local delivery techniques are not intended to be 
limiting but to be illustrative of the techniques available. Examples include local 
delivery catheters, such as an infusion or indwelling catheter, e.g., a needle infusion 
catheter, shunts and stents or other implantable devices, site specific carriers, direct 
injection, or direct applications. 

For topical administration, the therapeutic agents may be formulated as is 
known in the art for direct application to a target area. Conventional forms for this 
purpose include wound dressings, coated bandages or other polymer coverings, 
ointments, creams, lotions, pastes, jellies, sprays, and aerosols. Ointments and 
creams may, for example, be formulated with an aqueous or oily base with the 
addition of suitable thickening and/or gelling agents. Lotions may be formulated 
with an aqueous or oily base and will in general also contain one or more 
emulsifying agents, stabilizing agents, dispersing agents, suspending agents, 
thickening agents, or coloring agents. The active ingredients can also be delivered 
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via iontophoresis, e.g., as disclosed in U.S. Patent Nos. 4,140,122; 4,383,529; or 
4,05 1 ,842. The percent by weight of a therapeutic agent of the invention present in 
a topical formulation will depend on various factors, but generally will be from 
0.01% to 95% of the total weight of the formulation, and typically 0.1-25% by 
5 weight. 

Drops, such as eye drops or nose drops, may be formulated with an aqueous 
or non-aqueous base also comprising one or more dispersing agents, solubilizing 
agents or suspending agents. Liquid sprays are conveniently delivered from 
pressurized packs. Drops can be delivered via a simple eye dropper-capped bottle, 
10 or via a plastic bottle adapted to deliver liquid contents dropwise, via a specially 
shaped closure. 

3 The therapeutic agent may further be formulated for topical administration in 

™ the mouth or throat. For example, the active ingredients may be formulated as a 

B lozenge further comprising a flavored base, usually sucrose and acacia or tragacanth; 

p 

fj 15 pastilles comprising the composition in an inert base such as gelatin and glycerin or 

fj sucrose and acacia; and mouthwashes comprising the composition of the present 

h 

invention in a suitable liquid carrier. 

The formulations and compositions described herein may also contain other 
ingredients such as antimicrobial agents, or preservatives. Furthermore, the active 
p 20 ingredients may also be used in combination with other therapeutic agents, for 

^ example, bronchodilators. 

In particular, for delivery of a vector of the invention to a tissue such as 
muscle, any physical or biological method that will introduce the vector into the 
muscle tissue of a host animal can be employed. Vector means both a bare 
25 recombinant vector and vector DNA packaged into viral coat proteins, as is well 
known for AAV administration. Simply dissolving an AAV vector in phosphate 
buffered saline has been demonstrated to be sufficient to provide a vehicle useful for 
muscle tissue expression, and there are no known restrictions on the carriers or other 
components that can be coadministered with the vector (although compositions that 
30 degrade DNA should be avoided in the normal manner with vectors). 
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Pharmaceutical compositions can be prepared as injectable formulations or as 
topical formulations to be delivered to the muscles by transdermal transport. 
Numerous formulations for both intramuscular injection and transdermal transport 
have been previously developed and can be used in the practice of the invention. 
5 The vectors can be used with any pharmaceutically acceptable carrier for ease of 
administration and handling. 

For purposes of intramuscular injection, solutions in an adjuvant such as 
sesame or peanut oil or in aqueous propylene glycol can be employed, as well as 
sterile aqueous solutions. Such aqueous solutions can be buffered, if desired, and 
10 the liquid diluent first rendered isotonic with saline or glucose. Solutions of the 
AAV vector as a free acid (DNA contains acidic phosphate groups) or a 
pharmacologically acceptable salt can be prepared in water suitably mixed with a 
surfactant such as hydroxypropylcellulose. A dispersion of AAV viral particles can 
Qj also be prepared in glycerol, liquid polyethylene glycols and mixtures thereof and in 

15 oils. Under ordinary conditions of storage and use, these preparations contain a 

preservative to prevent the growth of microorganisms. In this connection, the sterile 
aqueous media employed are all readily obtainable by standard techniques well- 
known to those skilled in the art. 
^ The pharmaceutical forms suitable for injectable use include sterile aqueous 

p 20 solutions or dispersions and sterile powders for the extemporaneous preparation of 

sterile injectable solutions or dispersions. In all cases the form must be sterile and 
must be fluid to the extent that easy syringability exists. It must be stable under the 
conditions of manufacture and storage and must be preserved against the 
contaminating action of microorganisms such as bacteria and fungi. The carrier can 
25 be a solvent or dispersion medium containing, for example, water, ethanol, polyol 
(for example, glycerol, propylene glycol, liquid polyethylene glycol and the like), 
suitable mixtures thereof, and vegetable oils. The proper fluidity can be maintained, 
for example, by the use of a coating such as lecithin, by the maintenance of the 
required particle size in the case of a dispersion and by the use of surfactants. The 
30 prevention of the action of microorganisms can be brought about by various 



yi s 



O 



43 



antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 
sorbic acid, thimerosal and the like. In many cases it will be preferable to include 
isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of 
the injectable compositions can be brought about by use of agents delaying 
absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the AAV vector in 
the required amount in the appropriate solvent with various of the other ingredients 
enumerated above, as required, followed by filtered sterilization. Generally, 
dispersions are prepared by incorporating the sterilized active ingredient into a 
sterile vehicle which contains the basic dispersion medium and the required other 
ingredients from those enumerated above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the preferred methods of preparation are 
vacuum drying and the freeze drying technique which yield a powder of the active 
ingredient plus any additional desired ingredient from the previously sterile-filtered 
solution thereof. 

For purposes of topical administration, dilute sterile, aqueous solutions 
(usually in about 0.1% to 5% concentration), otherwise similar to the above 
parenteral solutions, are prepared in containers suitable for incorporation into a 
transdermal patch, and can include known carriers, such as pharmaceutical grade 
dimethylsulfoxide (DMSO). 

The therapeutic compounds of this invention may be administered to a 
mammal alone or in combination with pharmaceutically acceptable carriers. As 
noted above, the relative proportions of active ingredient and carrier are determined 
by the solubility and chemical nature of the compound, chosen route of 
administration and standard pharmaceutical practice. 

The dosage of the present therapeutic agents which will be most suitable for 
prophylaxis or treatment will vary with the form of administration, the particular 
compound chosen and the physiological characteristics of the particular patient 
under treatment. Generally, small dosages will be used initially and, if necessary, 
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will be increased by small increments until the optimum effect under the 
circumstances is reached. Exemplary dosages are set out in the example below. 

Since AAV has been shown to have a broad host range (for pulmonary 
expression) and persists in muscle, the vectors of the invention may be employed to 
express a gene in any animal, and particularly in mammals, birds, fish, and reptiles, 
especially domesticated mammals and birds such as cattle, sheep, pigs, horses, dogs, 
cats, chickens, and turkeys. Both human and veterinary uses are particularly 
preferred. 

The gene being expressed can be either a DNA segment encoding a protein, 
with whatever control elements (e.g., promoters, operators) are desired by the user, 
or a non-coding DNA segment, the transcription of which produces all or part of 
some RNA-containing molecule (such as a transcription control element, +RNA, or 
anti-sense molecule). 

Muscle tissue is a very attractive target for in vivo gene delivery and gene 
therapy, because it is not a vital organ and is very easy to access. If a disease is 
caused by a defective gene product which is required to be produced and/or secreted, 
such as hemophilia, diabetes and Gaucher' s disease, and the like, is muscle is a good 
candidate to supply the gene product if the appropriate gene can be effectively 
delivered into the cells. 

Different vectors, such as naked DNA, adenovirus and retrovirus, have been 
utilized to directly deliver various transgenes into muscle tissues. However, neither 
system can offer both high efficiency and long-term expression. For naked plasmid 
DNA directly delivered into muscle tissue, the efficiency is not high. There are only 
a few cells near the injection site that can maintain transgene expression. 
Furthermore, the plasmid DNA in the cells remains as non-replicating episomes, i.e., 
in the unintegrated form. Therefore, it will be eventually lost. For adenovirus 
vector, it can infect the non-dividing cells, and therefore, can be directly delivered 
into the mature tissues such as muscle. However, the transgene delivered by 
adenovirus vectors are not useful to maintain long-term expression for the following 
reasons. First, since adenovirus vectors still retain most of the viral genes, they are 
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not very safe. Moreover, the expression of those genes can cause the immune 
system to destroy the cells containing the vectors (see, for example, Yang et al. 
1994, Proc. Natl. Acad. Sci. 91 :4407-441 1). Second, since adenovirus is not an 
integration virus, its DNA will eventually be diluted or degraded in the cells. Third, 
due to the immune response, adenovirus vector could not be repeatedly delivered. 
In the case of lifetime diseases, this will be a major limitation. For retrovirus 
vectors, although they can achieve stable integration into the host chromosomes, 
their use is very restricted because they can only infect dividing cells while a large 
majority of the muscle cells are non-dividing. 

Adeno-associated virus vectors have certain advantages over the above- 
mentioned vector systems. First, like adenovirus, AAV can efficiently infect non- 
dividing cells. Second, all the AAV viral genes are eliminated in the vector. Since 
the viral-gene-expression-induced immune reaction is no longer a concern, AAV 
vectors are safer than Ad vectors. Thirds, AAV is an integration virus by nature, 
and integration into the host chromosome will stably maintain its transgene in the 
cells. Fourth, AAV is an extremely stable virus, which is resistant to many 
detergents, pH changes and heat (stable at 56°C for more than an hour). It can be 
lyophilized and redissolved without losing its activity. Therefore, it is a very 
promising delivery vehicle for gene therapy. 

The invention will be further described by, but is not limited to, the 
following examples. 

Example 1 

Materials and Methods 
Construction of rAAV Shuttle Vector. 

A recombinant AAV shuttle vector (AV.GFP3ori) which contained a GFP 
transgene cassette, bacterial ampicillin resistance gene, and bacterial origin of 
replication, was generated from a m-acting plasmid (pCisAV.GFP3ori). 
Expression of the GFP gene was directed by the CMV promoter/enhancer and SV40 
poly-adenylation sequences. pCisAV.GFP3ori was constructed with pSub201 



46 



derived ITR elements (Samulski et al, 1987) and the intactness of ITR sequences 
was confirmed by restriction analysis with Smal and PvuII, and by sequencing. 
Recombinant AAV stocks were generated by co-transfection of pCisAV.GFP3ori 
and pRep/Cap together with co-infection of recombinant Ad.CMVlacZ in 293 cells 
(Duan et al., 1997). Following transfection of forty 150 mm plates, cells were 
collected at 72 hours by centrifugation and resuspended in 12 ml of buffer (10 mM 
Tris pH 8.0). Virus was released from cells by three cycles of freeze/thawing and 
passaged through a 25 gauge needle six times. Cell lysates were then treated with 
1.3 mg/ml DNase I at 37°C for 30 minutes and 1% deoxycholate (g/ml final) and 
0.05% trypsin (g/ml final) at 37 °C for 30 minutes. Samples were then placed on ice 
for 10 minutes and centrifuged to remove large particulate material at 3,000 rpm for 
30 minutes. 

rAAV was purified by isopycnic density gradient centrifugation in CsCl 
(r=1.4) in a SW55 rotor for 72 hours at 35K. Peak fractions of AAV were combined 
and re-purified through two more rounds of CsCl centrifugation, followed by 
heating at 58 °C for 60 minutes to inactivate all contaminant helper adenovirus. 
Typically, this preparation gave approximate AAV titers of 10 12 DNA molecules/ml 
and 2.5 x 10 8 GFP-expressing units/ml. Recombinant viral titers were assessed by 
slot blot and quantified against pCisAV.GFP3ori controls for DNA particles. 
Functional transducing units were quantified by GFP transgene expression in 293 
cells. The absence of helper adenovirus was confirmed by histochemical staining of 
rAAV infected 293 cells for beta-galactosidase, and no recombinant adenovirus was 
found in 10 10 particles of purified rAAV stocks. The absence of significant wtAAV 
contamination was confirmed by immunocytochemical staining of rAAV/ Ad 
co-infected 293 cells with anti-Rep antibodies. These studies, which had a 
sensitivity of 1 wtAAV in 10 10 rAAV particles, demonstrated an absence of Rep 
staining as compared to pRep/Cap plasmid transfected controls. 
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Isolation and Structural Evaluation of AAV Circular Intermediates From Hela Cells. 

Hela cells were grown in 35 mm dishes in DMEM media supplemented with 
10% fetal calf serum (FCS). Cells were infected in the presence of 2% FCS at 80% 
confluency with recombinant AV.GFP3ori (MOI=1000 particles/cell, 1 x 10 9 total 
particles/plate) and Hirt DNAs isolated as described by Duan et al. (1997) at 6, 12, 
24, 48, and 72 hours post-infection. In experiments analyzing the effects of 
adenovirus, plates were co-infected with Ad.CMVLacZ (MOI=5000 particles/cell) 
in the presence of 2%FCS/DMEM. Zero hour controls were generated by mixing 
10 9 particles of AV.GFP3ori with cell lysates prior to Hirt DNA preparation. Hirt 
DNA isolated at each time point was used to transform E. coli SURE cells 
(Stratagene, La Jolla, CA.). Typically, 1/10 of the Hirt DNA preparation was used 
to transform 40 ml of competent bacteria by electroporation. The resultant total 
number of bacterial colonies was quantified for each time point and the structure of 
circular intermediates was evaluated for greater than 20 plasmid clones for each 
time point from two independent experiments. Structural determinations were based 
on restriction enzyme analysis using PstI, SphI, Asel single and double digests 
together with Southern blotting against GFP, stuffer, and ITR probes. 
Evaluation of E2a and GFP gene expression in Hela cells. 

E2a gene expression was evaluated by immunofluorescent staining of Hela 
cells superinfected with El-deleted Ad.CMVlacZ (MOI= 0, 500, 5000 
particles/cell). Briefly, cells were fixed in methanol at -20°C for 10 minutes 
followed by air drying. Cells were then incubated at room temperature with 
hybridoma supernatant against Ad5 72kd DBP (Reich et al, 1983), followed by goat 
anti-mouse-FITC antibody (5 mg/ml) for 30 minutes at room temperature. In 
studies evaluating augmentation of AAV GFP transgene expression by adenovirus, 
Hela cells were harvested at 24 or 72 hours post-infection by trypsinization, 
resuspended in 2%FCS/PBS and evaluated by FACS analyses. Thresholds were set 
using uninfected controls and the percentage and/or the average relative fluorescent 
intensity was determined by sorting greater than 10 5 cells per experiment condition. 
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Sequence Analysis of AAV Circular Intermediates. 

Sequence analysis of the ITR array within circular intermediates was 
performed using primers ELI 18 (5 '-CGGGGGTCGTTGGGCGGTCA-3 '; SEQ ID 
NO:l) and EL230 (5 ' -GGGCGGAGCCT ATGGAAAA-3 ' ; SEQ ID NO:2) which 
5 are nested to 5' and 3' ITR sequences, respectively. Both circular and linearized 
(with Smal which cuts within ITR sequences) plasmids were sequenced. 
Results 

Construction of rAAV Shuttle Vector and Isolation of Circular Intermediates. 

To circumvent the inability to retrieve pre-integration intermediates or as 
10 stable episomal forms resistant to nuclease digestion, an alternative strategy was 
developed to "trap" circular intermediates using a recombinant AAV shuttle vector. 
Recombinant AV.GFP3ori virus (Figure IB) was generated from a m-acting 
plasmid (pCisAV.GFP3ori, Figure 1 A) by co-transfection in 293 cells with 
trans-acting plasmids encoding Rep and Cap viral genes. This viral vector 
15 (AV.GFP3ori) encoded the green fluorescent protein (GFP) reporter gene, a 

bacterial origin of replication (ori), and the bacterial ampicillin-resistance gene. Ori 
and ampicillin-resistance sequences encoded in this virus allow for the rescue of 
circular AAV genomes formed during the transduction process. 
P To test this strategy, Hela cells were infected with AV.GFP3ori (MOI=1000 

CP ■ 

Q 20 particles/cell) and the abundance of circular intermediates was evaluated following 

^ transformation of low molecular weight cellular Hirt DNA into E. coli SURE cells. 

The presence of circular intermediates was inferred by retrievable 
ampicillin-resistant bacterial colonies. Structural features of circular intermediates 
were determined by restriction enzyme analysis and Southern blotting with various 
25 regions of the provirus, including GFP, Stuffer, and ITR sequences. 

The predominant circular form isolated after transduction of Hela cells with 
AV.GFP3ori consisted of 4.7 kb monomer-sized molecules (Figure 1C). SphI 
digestions of these circular intermediates yielded characteristic 300 bp bands which 
hybridized to an ITR probe on Southern blots (Figure 2A). PstI, SphI, Asel single 
30 and double digests together with Southern blot analysis using GFP, Stuffer (data not 



4=* 



49 



shown), and ITR (Figure 2A) probes confirmed the structure of the circular 
intermediates as head-to-tail monomer genomes (Figure 1C). In particular, PstI 
digests together with ITR Southern blots distinguish these head-to-tail circular 
intermediates from head-to-head or tail-to-tail circular dimers. Similar results 
5 obtained with studies on AV.GFP3ori infected 293 cells and primary fibroblasts 
have confirmed that monomer head-to-tail circular intermediates were also the most 
abundant form in these cell types. 

Because the predicted molecular weight of an intact head-to-tail ITR SphI 
fragment would be approximately 360 bp, an anomalous migration in agarose gels 
10 might be due to the high secondary structure of inverted repeats within ITRs. To 
this end, the head-to-tail orientation of the ITRs, as predicted by Southern blot 
q analysis, was confirmed using several sequencing strategies. First, the SphI ITR 

|| hybridizing fragment of a circular intermediates was subcloned into a secondary 

OS plasmid vector and sequenced with primers outside the ITR cloned sequences. 

BSCS 

yf| 15 These findings confirmed the head-to-tail orientation of ITRs. Additionally, 

sequence was obtained directly from six monomer circular intermediate clones using 
s primers internal to both the 5' and 3' ITRs (Figure 2C). In these studies, circular 

^ intermediates were digested with Smal and the linear 4.6 kb plasmid was gel 

0 isolated prior to sequencing. Smal digestion (which relaxed the secondary structure 

01 

p 20 of ITRs) was necessary to obtain sequence information within the ITRs. The 

Q 

sequencing results presented in Figure 2C confirmed the orientation of head-to-tail 
ITR arrays in these intermediates. 

Interestingly, sequencing also revealed several consistent base pair (bp) 
changes in four of the six clones analyzed (Figure 2C). These four clones (p79, 

25 p81, p87, and p88) had consistent two bp changes within the D-sequence [G->A 
(122 bp) and A->G (125 bp)], which always occurred together with the bp 
alterations in the p5 promoter [A->G (1 14 bp) and A->C (115 bp)]. No other 
consistent bp changes were noted except for two clones (p79 and p88) which 
demonstrated mutations just outside the 3TTR D-sequence [T->G (381 bp) and 

30 T->C (383 bp)]. 
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Although head-to-tail circular intermediates were the most abundant forms 
present in Hirt DNA from rAAV infected Hela cells, several less frequent structures 
were also detected. These included monomer circularized AAV genomes with one 
(pi 90) and three ITRs (p345) arranged in a head-to-tail fashion as well as several 
clones with an unknown structure lacking complete ITRs (p340) (Figure 2A). Such 
diversity within the ITR array may represent homologous recombination in vivo or 
in bacteria during amplification. However, previous studies demonstrating similar 
variations in ITR sequences of head- to- tail integrated genomes, suggest that such 
changes in the length of the ITR array may occur in vivo (Duan et al., 1997) 
Additionally, less frequent head-to-tail circularized multimer forms were predicted 
based on the variation in migration patterns of uncut plasmids which gave identical 
restriction patterns. Results shown in Figure 2B confirmed the existence of 
monomer and dimer head-to-tail circular intermediates using partial digestion with 
an enzyme which cuts once in the AAV genome (Asel). Cumulative analysis of 
greater than 200 independently isolated circular intermediates from Hela cells 
demonstrated that head-to-tail circular AAV genomes occurred in greatest 
abundance as monomers (92%) and less frequently as multimers of greater than one 
genome (8%). 

To establish that head-to-tail circular intermediates were formed in vivo and 
not by non-specific bacterial recombination of linear AAV genomes present in the 
Hirt DNA, a set of reconstitution experiments was performed by which the same 
number of rAAV particles used for infection experiments were spiked into Hela cell 
lysates prior to Hirt preparations. In these studies, background bacterial 
amplification of Hirt DNA spiked with linear rAAV genomes was negligible 
(Figure 3D) and of the few isolated colonies obtained from these controls, none had 
a predicted head-to-tail structure as assessed by Southern blot restriction enzyme 
analysis (Figure 3E). Additionally, reconstitution experiments transforming 
bacterial with linearized dsDNA AAV genomes did not give rise to significant 
levels of replication competent plasmids or the characteristic head-to-tail structure 
associated with AAV circular intermediates. These findings confirm that circular 



51 



intermediates do not likely arise from non-specific recombination or ligation events 
with either ssDNA or dsDNA linear AAV genomes in bacteria. Additional control 
experiments, demonstrating the lack of stuffer hybridizing sequences in AAV 
circular intermediates by Southern blotting, also confirm that these structures do not 
arise from contamination of viral stocks with pCisAV.GFP3ori plasmid. 
The formation of head- to-tail circular AAV intermediates is auemented by 
superinfection with El -deleted adenovirus. 

Many aspects of the wtAAV growth cycle are affected by helper adenovirus, 
including AAV DNA replication, transcription, splicing, translation, and virion 
assembly. Such studies have provided concrete evidence that a subset of Ad early 
gene products provide helper functions for the wtAAV lytic cycle, including: El a, 
Elb, E2a, E4 ORF6 and VA1 RNA (Muzyczka, 1992). In this regard, one of the 
most critical factors which is required for AAV replication is the 34 kD E4 protein 
(ORF6). Recent observations on the helper function of Ad in rAAV transduction 
have also demonstrated that Ad E4 ORF6 is essential for the augmentation of rAAV 
transgene expression seen with adenovirus co-infection (Ferrari et al., 1996; Fisher 
et al., 1996). According to these reports, the rate-limiting step enhanced by these 
adenoviral proteins is the conversion of single stranded AAV genomes to double 
stranded forms. 

Studies evaluating the kinetics of rAAV circular intermediate formation 
demonstrated a time-dependent increase in abundance which peaked at 24 hours 
post-infection in Hela cells and coincided with the onset of GFP transgene 
expression (Figure 3). To better understand the cellular mechanisms associated with 
AAV circular intermediate formation, the effects of adenoviral co-infection on this 
process were evaluated. The extent of transgene expression and circular 
intermediate formation in AV.GFP3ori infected Hela cells with or without 
co-infection with El -deleted recombinant adenovirus was compared. 

Although El -deleted adenoviruses are severely handicapped in their ability 
to synthesize viral gene products, at high MOIs of >5000 significant E2a protein 
expression was noted (Figure 3 A). As an indicator of transgene expression, the 
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abundance and average relative intensity of GFP positive cells was determined 
against mock infected controls by fluorescent microscopy (Figure 3B) and FACS 
analysis (Figure 3C) at 72 hours post-infection. In accord with previous reports 
demonstrating augmentation in rAAV transgene expression by adenovirus (Ferrari 
et al., 1996; Fisher et al., 1996), the extent of GFP transgene expression was 
dramatically increased at doses of adenovirus which led to viral gene expression 
(MOI>5000; Figures 3A-C). Additionally, persistence of rAAV transgene 
expression was also augmented by co-infection with El -deleted adenovirus, as 
determined by GFP-expressing colony formation following serial passages 
(Figure 3C). 

If circular intermediates represent a molecular form of rAAV important for 
efficient and/or persistent transgene expression, augmentation of rAAV transgene 
expression by adenovirus might also modulate circular intermediate formation. In 
these studies, the abundance and time course of AAV circular intermediate 
formation was evaluated following superinfection with Ad.CMVLacZ. Results 
from these experiments are shown in Figure 3D, which represents the total number 
of bacterial colonies (per 35 mm plate) obtained following transformation of E. coli 
with Hirt DNA isolated from Hela cells infected with AV.GFP3ori (1000 DNA 
particles/cell) with or without co-infection with Ad.CMVlacZ (5,000 particles/cell). 
An MOI of 5000 Ad particles/cell was chosen for these experiments since this level 
of adenovirus led to minimal cytopathic effect (CPE) with high levels of E2a 
expression. 

These studies demonstrated a nearly 2-fold augmentation by Ad.CMVLacZ 
in the total abundance of AAV rescued plasmid intermediates in E. coli (Figure 3D). 
Southern blot restriction enzyme analysis demonstrated that the predominant forms 
in both the presence and absence of adenovirus were head-to-tail monomer circular 
intermediates containing the diagnostic 300 bp ITR fragment following SphI 
digestion (Figure 3E). Additionally, results demonstrated that adenovirus 
co-infection led to an earlier time of onset and increased stability of AAV 
head-to-tail monomer circular intermediates (Figures 3E and F). For example, at 
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6 hours post-infection, head-to-tail circular intermediates were only present in Hela 
cells co-infected with adenovirus. Furthermore, a decline in the percentage of 
head-to-tail circular intermediate clones was seen at 48-72 hours post-AAV 
infection in the absence of adenovirus. In contrast, this decline was significantly 
blunted by the presence of helper adenovirus (Figure 3F). Based on these findings, 
it was concluded that certain adenoviral proteins produced by superinfection with 
El -deleted adenovirus were capable of modulating circular intermediates formation 
and stability during rAAV transduction. 
Discussion 

In the present study, it was shown that circularization of linear AAV 
genomes occurs during rAAV transduction. Circularization appears to 
predominately occur as head-to-tail monomer genomes. However, the existence of 
less abundant circular multimer forms suggests that recombinational events 
subsequent to the initial infection may drive concatamerization of circular genomes. 
The diversity in the length of ITR arrays found within circular intermediates (i.e., 
1-3 ITRs) also supports the notion that these forms may be highly recombinagenic. 
Of mechanistic interest in the formation of circular intermediates is the uniformity 
of mutations observed in the D-sequences and nearby p5 promoter region and the 
confinement of these mutations to the 5 '-ITRs. Although the etiology of these base 
pair changes is unknown, their uniformity suggests that they may have a direct role 
in the formation of circular intermediates and in increased stability. Recent 
findings, which suggest that an endogenous host single strand D-sequence binding 
protein is important in rAAV transduction, lend support to the potential involvement 
of this sequence in circular intermediate formation (Wang et al., 1997; Qing et al., 
1998). Furthermore, it remains to be determined whether the in vivo formation of 
AAV circular intermediates occurs through the circularization of single or double 
stranded AAV genomes. 

By analogy, retroviral transduction intermediates have striking similarities to 
the current findings with AAV. Three DNA forms have been isolated following 
retroviral infection, including linear DNA with long terminal repeats (LTRs) at both 
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ends, circular DNA with one LTR, and circular DNA with multiple LTRs 
(Panganiban, 1985). Although it is disputed which of these forms are the direct 
precursor to integration, the existence of circular retroviral genomes which also 
have similar repeat regions at the ends of their genomes suggests the potential for 
common mechanisms with the formation of AAV circular intermediates. These 
AAV circular intermediates could act as integration precursors and/or stable 
episomal genomes. 

The head-to-tail ITR structures found in AAV circular intermediates are 
most characteristic of latent integrated AAV genomes. In contrast, lytic phases of 
AAV growth are typically associated with head-to-head and tail-to-tail replication 
form genomes. Hence, it is likely that circular intermediates represent a latent aspect 
of the AAV life cycle. The finding that co-infection with adenovirus leads to 
increased abundance and stability of AAV circular intermediates suggests a novel 
link between adenoviral helper functions and latent infection of AAV. 

Aspects of inverted head-to-tail ITRs, which include palindromic hairpins 
similar in structure to "Holliday-like" junctions, might impart recombinagenic 
activity which aids in viral integration. Such Holliday junctions have been shown to 
play critical roles in directing homologous recombination in bacteria through the 
processing of recombination intermediates by RuvABC proteins (West, 1997; Lee et 
al., 1998). Interestingly, a mammalian endonuc lease, analogous to bacterial RuvC 
resolvase, has also been isolated from cell lines (Hyde et al., 1994). Despite the 
theoretical considerations which might suggest that circular AAV genomes have 
characteristics of preintegration intermediates, a study with recombinant retrovirus 
has demonstrated that palindromic LTR-LTR junctions of MMLV are not efficient 
substrates for proviral integration (Lobel et al., 1989). Nonetheless, circular AAV 
genomes have been previously proposed as integration intermediates based on 
proviral structure (Linden et al., 1996). 
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Example 2 

Methods 

Production of rAAV Shuttle Vector. 

The c/s-acting plasmid (pCisAV.GFP3ori) used for rAAV production was 
generated by subcloning the Bspl201/Not I fragment (743 bp) of the GFP transgene 
from pEGFP-1 (Clontech) between the CMV enhancer/promoter and SV40polyA by 
blunt-end ligation. A 2.5 kb cassette containing beta-lactamase and bacterial 
replication origin from pUC19 was blunt ligated down-stream of GFP reporter 
cassette. The ITR elements were derived from pSub20 1.2 The entire plasmid 
contains a 4.7 kb AAV component flanked by a 2 kb stuffer sequence. The integrity 
of ITR sequences was confirmed by restriction analysis with Smal and PvuII, and by 
direct sequencing using a modified di-deoxy procedure which allowed for complete 
sequence through both 5' and 3' ITRs. Recombinant AAV stocks were generated by 
co-transfection of pCisAV.GFP3ori and pRep/Cap together with co-infection of 
recombinant Ad.CMVlacZ in 293 cells. The rAV.GFP3ori virus was subsequently 
purified through 3 rounds of CsCl banding as described in Duan et al., 1997. The 
typical yields from this viral preparation were 1012 DNA molecules/ml. 

DNA titers were determined by viral DNA slot blot hybridization against 
GFP 32 P-labeled probe with copy number plasmid standards. The absence of helper 
adenovirus was confirmed by histochemical staining of rAAV infected 293 cells for 
beta-galactosidase, and no recombinant adenovirus was found in 10 10 particles of 
purified rAAV stocks. The absence of significant wtAAV contamination was 
confirmed by immunocytochemical staining of rAAV/Ad co-infected 293 cells with 
anti-Rep antibodies. Transfection with pRep/Cap was used to confirm the 
specificity of immunocytochemical staining. No immunoreactive Rep staining was 
observed in 293 cells infected with 10 10 rAAV particles. 
Isolation of AAV Circular Intermediates From Muscle. 

The tibialis anterior muscle of 4-5 week old C57BL/6 mice were infected 
with AV.GFP3ori (3 X 10 10 particles) in Hepes buffered saline (30 ^1). GFP 
expression was analyzed by direct immunofluorescence of freshly excised tissues 
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and/or in formalin-fixed cryopreserved tissue sections in four independently injected 
muscles harvested at 0, 5, 10, 16, 22 and 80 days post-infection. Tissue sections 
were counter-stained with propidium iodide to identify nuclear DNA. Hirt DNA 
(Hirt, 1967) (20 ml per muscle sample) was isolated from at least three independent 
muscle specimen for each time point and used to transform E. coli SURE cells using 
3 ml of Hirt with 40 ml of electrocompetent bacterial (approximately 1 x 10 9 cfu/ug 
DNA, Strategene Inc.). The resultant total number of bacterial colonies was 
quantified for each time point and the abundance of head-to-tail circular 
intermediates was evaluated for each time point (> 20 bacterial clones analyzed) by 
PstI, Asel, SphI, and Pstl/Asel digestion, and confirmed by Southern blot analysis 
using ITR, GFP and stuffer probes. The head-to-tail configuration in typical clones 
were also confirmed by dideoxy sequencing using primers ELI 18 
(5 '-CGGGGGTCGTTGGGCGGTCA-3 ' ; SEQ ID NO:l) and EL230 
(5 '-GGGCGGAGCCTATGGAAAA-3 '; SEQ ID NO:2) which are nested to 5 ; and 
3 ' ITR sequences, respectively. Zero hour controls were generated by mixing 3 x 
10 10 particles of AV.GFP3ori with control uninfected muscle lysates prior to Hirt 
DNA preparation. As described in Table 1, a number of additional controls for were 
performed to rule out non-specific recombination of linear AAV genomes in 
bacteria as a source for isolated circular intermediates. 
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Table 1. Control Experiments for Rescue of Circular Intermediates in Bacteria 



Type of Input 
DNA 


Sonrcp of DNA 


Number of 
Molecules 


Number of 
Amp Resistant 
Bacterial 
(^ninnies 


Presence of 
Head-to-Tail 

Circular 
Intermediates 

e 


Purified rAVV 


Hirt from Infected 
Muscle (22 day) 


3x10'° 


approximately 
5 x 10 3 


Yes 


Purified rAAV 


Virus 
reconstituted into 
Uninfected 
Muscle Hirt a 


3xl0 10 


0 


No 


Linear ssDNA 
Encompassing 
rAAV 
Genome b 


Isolated from 
Purified Virus 


3 x 10 10 


2 


No 


Linear dsDNA 
Encompassing 
Entire rAAV 
Genome 


Isolated from 
proviral plasmid 
(HindIII/PvuII) c 


3 x 10 10 


3 


No 


Linear dsDNA 
Encompassing 
Entire rAAV 
Genome 
+ ligase d 


Isolated from 
proviral plasmid 
(Hindlll/PvuII) 


3xl0 10 


>6 x 10 3 


Yes 



a Purified virus was reconstituted into muscle homogenates prior to preparation of 
Hirt DNA. 



b Viral DNA predominantly contained single stranded genomes as evident by 
Southern blot analysis against with ITR probe. However, small amount of dsDNA 
AAV genomes also existed and are likely due to reannealing of single stranded 
genomes during preparation. Purified viral DNA concentrations were determined by 
OD 260 and 75 ng representing approximately 3 x 10 10 viral genomes were used for 
transformation of bacteria. 

0 Hindlll/PvuII digestion was used to remove the entire rAAV genome from 
pcisAV.GFP3ori. Hindlll and PvuII leave 10 and 0 bps of flanking sequence 
outside the 5' and 3' ITRs, respectively. The linear dsDNA fragment (4.7 kb) was 
gel isolated following blunting with T4 DNA polymerase and the DNA 
concentration determined by OD 260 . One hundred and fifty ng of linear fragment 
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representing approximately 3 x 10 10 viral genomes were used for transformation of 
bacteria. 

d Linear dsDNA viral genomes (Hindlll/PvuII blunted fragment) were treated with 
5 T4 DNA ligase prior to transformation of bacteria. 

e The presence of head-to-tail circular AAV intermediates were confirmed by 
restriction enzyme digestion (Asel, PstI, and SphI) and Southern blotting against 
ITR probe. 

10 

Fractionation of muscle Hirt DNA preparations. 

Preparative-scale fractionation of the muscle Hirt DNA was performed by 
1% agarose gel electrophoresis using the Bio-Rad Mini Prep Cell (Catalog 
#170-2908). A 4.5 ml (10.5 cm) tubular gel containing 1 x TBE buffer was poured 
2j 15 according to manufacturer's specification. A total of 20 ml Hirt preparation from 

one entire muscle sample was loaded on top of the gel. Electrophoresis was carried 
out at a constant current of 10 mA over a period of 5 hours. Sample eluent was 
drawn from the preparative gel apparatus by a peristaltic pump at a rate of 
100 ml/min and eluted into a fraction collector at 250 ml/fraction. The collected 
20 DNA was subsequently concentrated by standard ethanol precipitation and used to 
transform SURE bacterial cells by electroporation as described above. 
In vitro Persistence of AAV Circular Intermediates . 

Transgene expression and persistence of AAV circular intermediate plasmid 
clones were evaluated following transient transfection in Hela and 293 cells. 
25 Subconfluent monolayers of Hela cells in 24-well dishes were transfected with 
0.5 mg of either AAV circular intermediates (p81 or p87) or pCMVGFP using 
Lipofectamine (Gibco BRL Inc.). The cultures were then incubated for 5 hours in 
serum free DMEM followed by incubation in DMEM supplemented with 10% fetal 
bovine serum. All plasmid DNA samples used for transfections were spiked with 
30 pRSVlacZ (0.5 mg) as an internal control for transfection efficiency. At 48 hours 
post-transfection, cells were passaged at a 1:10 dilution and allowed to grow to 
confluency (day 5), at which time GFP clones were quantified for size and 
abundance using direct fluorescent microscopy. The percent of 
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beta-galactosidase-expressing cells was also quantified at this time point by X-gal 
staining. At 5 days, cells were passaged an additional time (1:15 dilution) GFP 
clones were quantified again at day 10. The persistence of plasmid DNA at 
passage-5, 7, and 10 days post-transfection was evaluated by Southern blot analysis 
of total cellular DNA using 32 P-labeled GFP probes. To determine whether the 
head-to-tail ITR array within circular intermediates was responsible for increases in 
the persistence of GFP expression, the head-to-tail ITR DNA element was 
subcloned into the pGL3 luciferase plasmid to generate pGL3(ITR). The 
head-to-tail ITR DNA element was isolated from a monomer circular intermediate 
(p81) by Aatll and Haell double digestion and subsequently inserted into the Sail 
site of pGL3 (Promega) by blunt ligation. The resultant plasmid pGL3(ITR) 
contains the luciferase reporter and head-to-tail ITR element 3 ' to the polyA site. 
The integrity of the ITR DNA element within this plasmid was confirmed by 
sequencing. The persistence of transgene expression from pGL3(ITR) was 
compared to that of pGL3 by luciferase assays on transiently transfected Hela cells 
as described above and analyzed at 10 days (passage-2). Transfection efficiencies 
were normalized using a dual renilla luciferase reporter vector (pRLSV40, 
Promega). 
Results 

AAV Circular Intermediates Represent Stable Episomal Forms of Viral DNA 
Associated with Long-term Persistence of Transgene Expression in Muscle. 

To evaluate the molecular characteristics of rAAV genomes in muscle, a 
rAAV shuttle viral vector (AV.GFP3ori) was utilized which harbors an ampicillin 
resistance gene, bacterial origin of replication, and GFP reporter gene (Figure 1A). 
This recombinant virus was used to evaluate the presence of circular intermediates 
by bacterial rescue of replication competent plasmids. In these studies, delivery of 
AV.GFP3ori (3 x 1010 particles) to the tibialis muscle of mice led to GFP transgene 
expression which peaked at 22 days and remained stable for at least 80 days 
(Figure 4A). These results confirmed previous successes in rAAV mediated gene 
transfer to muscle (Kessler et al., 1996; Herzog et al., 1997; Xiao et al., 1996; Clark 
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et al., 1997; Fisher et al., 1997). The formation of circular intermediates was 
evaluated by E. coli transformation of Hirt DNA harvested from muscle at 0, 5, 10, 
16, 22, and 80 days post-infection with AV.GFP3ori. 

In these muscle samples, circular intermediates were found to have a 
5 characteristic head- to-tail structure with 1-2 ITR repeats. The most abundant form 
included two inverted ITRs within a circularized genome (Figure 4B, clone pi 7). 
This figure also depicts a less frequent form (< 5%) of circular intermediates 
observed, p439, with undetermined structure. When this type of replication 
competent plasmid was seen, it was not included in the quantification of 
10 head-to-tail circular intermediates since its structure could not be conclusively 
determined. The total abundance of muscle Hirt derived head-to-tail circular 
intermediates (with 1-2 ITRs) demonstrated a time-dependent increase that peaked 
with transgene expression at 22 days and slightly decreased by day 80 (Figure 5 A). 
Increased diversity in the length of ITR arrays within circular intermediates was 
IP 15 seen at longer time points. For example, Figure 5B demonstrates several isolated 

*fj circular intermediates with 1-3 ITRs isolated from 80 days muscle Hirt samples, 

e This is in contrast to the more uniform structure of circular intermediates with two 

J- ITRs in a head-to-tail conformation at 5-22 days post-infection. 

P To evaluate the potential for artifactual rescue of linear rAAV genomes by 

y 20 recombination in bacteria, several control experiments were performed. First, 

3 

uninfected control muscle Hirt preparations, spiked with an equal amount of rAAV 
virus used for in vivo infection of muscles, failed to give rise to replicating plasmids 
following transformation of Exoli. Second, when a blunted linear double stranded 
Hindlll/PvuII fragment isolated from pcisAV.GFP3ori (encompassing the entire 

25 rAAV genome) was used to transform bacteria, no ampicillin resistant bacterial 
colonies were obtained. The addition of T4 ligase to this fragment, however, led to 
significant numbers of bacterial colonies. Third, when purified single stranded 
rAAV DNA was used for transformation, no bacterial colonies were obtained. As 
summarized in Table 1, these results confirm that in the absence of productive 

30 infection, rAAV genomes themselves are incapable of recombining into replication 
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competent plasmids in bacteria. Hence, in vivo circularization of rAAV genomes is 
a prerequisite for rescuing autonomously replicating plasmids in E. coli with this 
shuttle vector. 

Molecular weight of circular intermediates suggest a conversion from monomer to 
multimer forms over time. 

To further characterize the circular intermediates isolated from muscles, Hirt 
samples from 22 days and 80 days post-infected muscles were size fractionated by 
continuous-flow gel electrophoresis (BioRad). As shown in Figure 6 , the majority 
of circular intermediates at 22 days post-infection size fractionated at a molecular 
weight of less than 3 Kbp. Very few clones were isolated from fractions between 
3 to 5 kb and no clones were obtained from fractions larger than 5 kb at this time 
point. Furthermore, this size fractionated molecular weight of in vivo Hirt derived 
circular intermediates at 22 day time points correlated with that of head-to-tail 
monomer undigested circular intermediate plasmids rescued in bacteria from this 
same time point (approximately 2.5 kb). These data suggest that at early time points 
post-infection in muscle, the predominant form of circular intermediates likely 
occurs as monomer genomes. The lower mobility of this fraction as compared to 
replication form monomer (Rfm=4.7 kb) and dimer (Rfd=9.4 kb) genomes provides 
indirect evidence that these forms are not responsible for rescued plasmids in these 
Hirt samples. Interestingly, when 80 day muscle Hirt samples were size 
fractionated, more clones were retrieved from higher, molecular weight fractions 
ranging from 3-12 kb (Figure 6). This shift in the molecular weight of circular 
intermediates indicates the potential for recombination between monomer forms in 
the generation of large circular multimer genomes. Such concatamerization has 
been previously observed in muscle and has traditionally been hypothesized to 
involve linear integrated forms of the AAV genome (Herzog et al., 1997; Xiao et al., 
1996; Clark et al., 1997; Fisher et al., 1997). This data sheds new light on the 
molecular characteristics of these persistent AAV genomes and suggests that they 
are in fact circular and episomal. Based on yields of retrievable circular plasmids 
reconstituted in Hirt DNA, the efficiency of bacterial transformation, and the initial 
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innoculum of virus, we estimate that approximately 1 in 400 viral DNA particles 
circularize following infection in muscle (Table 2). 



Table 2. Yield of Circular Intermediate Isolation from Hirt DNA 



Bacterial 
Transformation 


Starting Number 
of Plasmid or 
AAV Genomes 


Actual Number 
of Amp r cfu 


Adjusted 
Yield 


Hirt DNA from rAAV 
Infected Muscle a 


3 x 10 10 molecules 


5x 10 3 cfu 


5xl0 5 cfu e 


Hirt DNA + 230 ng 
LacZ Plasmid b ' c 


3 x 10 10 molecules 


2x 10 6 cfu d 


2 x 10 8 cfu 


230 ng LacZ Plasmid c 


3 x 10 10 molecules 


2 x 10 8 cfu 





a The actual amount of Hirt used for transformation was 3/20 the entire Hirt DNA. 
The numbers have been adjusted to reflect viral innoculum and yields for the entire 
muscle. 



b Plasmid DNA was spiked into mock infected muscle homogenates prior to 
isolation of Hirt DNA. This reconstituted Hirt DNA was then used for 
transformation of bacteria. 

0 The actual microgram amounts of plasmid used in reconstitution experiments was 
10 ng. The numbers have been adjusted for comparison to normalize the number of 
plasmids genomes to that used in AAV experiments. Control LacZ plasmid was 
approximately 7000 bp with a molecular weight of 4.6 x 10 6 g/mole. 

d The average of several experiments indicates an approximate 100-fold reduction 
in the number of cfu recovered from bacterial transformations with DNA isolated 
from Hirt extract spiked with plasmids as compared to transformation with an 
equivalent amount of plasmid DNA alone. 

e Adjusted yield indicate approximately 1 in 400 AAV genomes circularize in vivo. 

Given the fact that not all rAAV particles likely contain functional DNA 
molecules and intermediates may integrate, these calculations may represent an 
underestimation. 
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AAV Circular Intermediates Demonstrate Increased Persistence as Plasmid Based 
Vectors. 

Based on the finding that circular AAV intermediates were associated with 
long term persistence of transgene expression in muscle, rAAV circular liead-to-tail 
intermediates may be molecular structures of the AAV genome associated with the 
latent life cycle and increased episomal stability. Several aspects of the structure of 
AAV circular intermediates may account for their increased stability in vivo. First, 
circularization of AAV genomes may create a nuclease resistant conformation. 
Secondly, since the only viral sequences contained within circular intermediates are 
the head-to-tail ITR array, these sequences might bind cellular factors capable of 
stabilizing these structures in vivo. Several studies have demonstrated increased 
persistence of transgene expression with plasmid DNA encoding viral ITRs (Philip 
et al., 1994; Vieweg et al., 1995). The results described above provide a functional 
explanation for the increased persistence through the association with circular 
intermediate formation as part of the AAV life cycle. 

To more closely evaluate the persistence of AAV head- to-tail circular 
intermediates, several in vitro experiments were performed by transfecting these 
intermediates into Hela cells and assessing the stability of plasmid DNA and 
transgene expression by GFP clonal expansion. Results from Hela cell transfection 
experiments demonstrated that two monomer head-to-tail circular intermediates 
(p81 and p87) studied gave rise to a 10-fold higher number of five and ten day 
transgene-expressing clones, as compared to a control pCMVGFP plasmid lacking 
the ITR sequences (Figures 7A and B). Additionally, the size of GFP positive 
colonies at 5 days post-transfection was three-fold larger in Hela cells transfected 
with p81 and p87, as compared to the pCMVGFP control vector (Figures 7A and 
B). These studies suggest the AAV circular intermediates have increased stability 
of transgene expression and substantiate findings in muscle. 

To confirm the increased molecular persistence of head-to-tail circular 
intermediates following transfection into Hela cells, total DNA (low and high 
molecular weight) was isolated from cultures of pCMVGFP and p81 transfected 
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Hela cells at various passages post-transfection and analyzed by Southern blotting. 
Southern blots hybridized to 32 P-labeled GFP probes demonstrated a significantly 
higher level of p81 plasmid DNA at passage-7 as compared to the control vector 
lacking the head-to-tail ITR sequence (Figure 7C). The majority of signal in 
undigested DNA samples was associated with a 4.7 kb band migrating at the 
approximate size of the uncut monomer plasmids. Together with the fact that the 
majority of signal from all cell cultures in Figure 7C disappeared by passage- 10, 
these data suggest that these plasmids predominantly remained episomal. Thus, in 
both muscle and Hela cells, increased persistence of AAV circular intermediates is 
correlated with stable transgene expression. 
ITR arrays are responsible for increased persistence. 

To investigate whether the head-to-tail ITR DNA element was responsible 
for the increased persistence of circular intermediates, we cloned this DNA element 
into a secondary luciferase vector (pGL3) to give rise to pGL3(ITR). Transient 
transfection experiments in Hela cells demonstrated a five-fold increase in the 
persistence of luciferase expression in serially-passaged cultures at 10 days in 
pGL3(ITR) as compared to that of pGL3 transfected (Figure 7D). These findings 
support the hypothesis that the head-to-tail ITR DNA element contained within 
circular intermediates is responsible for mediating the increased persistence of 
transgene expression and suggest a mechanism by which these molecular 
intermediates may confer stability to AAV genomes in vivo. Furthermore, increases 
in the stability of transgene expression conferred by this element appear to be 
primarily context independent, since the head-to-tail ITR element was 3' to the 
luciferase gene in pGL3(ITR) and 5' to the GFP transgene in AAV circular 
intermediates. 
Discussion 

Characterization of integrated proviral structures in different cell lines has 
demonstrated head-to-tail genomes as the predominant structural forms for both 
wild type and recombinant AAV (McLaughlin et al., 1988; Cheung et al., 1980; 
Duan et al., 1997). This is in contrast to the head-to-head and tail-to-tail structures 
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observed in AAV replication intermediates (Rfm and Rfd). Both Rfin and Rfd 
configurations have also been demonstrated in rAAV infected cells and enhanced 
conversion of ssAAV genomes to double stranded Rfm and Rfd forms has been 
suggested as a mechanism for augmentation of rAAV transduction by adenovirus in 
5 cell lines (Ferrari et al., 1996; Fisher et al., 1996). However, it is plausible that the 
mechanisms responsible for the formation of Rfm and Rfd molecules are different 
from pathways which lead to long-term transgene expression. In support of this 
hypothesis is a recent study evaluating augmentation of rAAV transgene expression 
by adenovirus in liver (Snyder et al., 1997). These studies have demonstrated that 
10 co-infection of the liver with adenovirus and rAAV enhances short term transgene 
expression while long term expression was no different than rAAV alone. The exact 

— mechanism for the formation of head-to-tail circular intermediates is not clear, 

B 

yp however similar structures have been demonstrated to act as pre-integration 

jjg intermediates for retrovirus (Varmus, 1982). In this regard, circularized retroviral 

J 15 genomes with one and two viral LTRs have been proposed. In addition, circular 

yl pre-integration intermediates have also been suggested by recent studies on wtAAV 

T" integration (Linden et al., 1996b). The demonstration that circular intermediates 

*Z exist in rAAV infected muscle explains several features of latent phase infection 

D with rAAV vectors including proviral structure and stable episomal persistence. 

20 Previous studies have suggested that rAAV genomes delivered to muscle 

might persist as head-to-tail concatamers (Herzog et al., 1997; Clark et al., 1997; 
Fisher et al., 1997). However, it is currently unknown whether these concatamers 
exist as free episomes or as integrated proviruses in the host genome. The results 
described above, i.e., demonstrating prolonged persistence of head-to-tail circular 
25 intermediates at 80 days post-infection, suggest that a large percentage of rAAV 
genomes may remain episomal. The conversion of monomer circularized genomes 
to larger circularized multimers appears to be an aspect associated with long term 
persistence and likely represents recombinational events between monomer 
intermediates. Although the bacterial rescue strategy was not capable of 
30 satisfactorily addressing the size of multimers, our modified approach to size 
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fractionating Hirt DNA prior to bacterial rescue of intermediates lends support to 
this hypothesis. Additional supportive evidence for increased recombination over 
time is the finding that greater variability in the length of ITR arrays was observed 
at longer time points post-infection. For example, at 5-22 days the majority of 
5 circular intermediates contained 2 ITRs in a head-to-tail fashion. This is in contrast 
to 80 day time points where the lengths of ITR arrays ranges from 1-3 ITRs. Such 
diversity of ITR arrays in muscle infected with AAV has been previously found 
using PCR approaches (Herzog et al., 1997; Fisher et al., 1997). In addition, the 
30% decline in the abundance of circular intermediates in muscle between 22 and 80 
10 days also supports a hypothesis that these molecular forms of AAV may represent 
pre-integration complexes. 
^ Given the fact that circular intermediates had long term persistence in 

yB muscle, certain structural features of these intermediates may affect episomal 

a— 
y 3 

03 stability of DNA. Previous studies have noted increased persistence of transgene 
^ 15 expression from plasmids encoding AAV ITRs (Philip et al., 1994; Vieweg et al., 

Ul 1995). However, the physiologic significance of this finding has remained elusive. 

7" The present study, demonstrating the head-to-tail ITR arrays isolated from AAV 

^ circular intermediates can confer increased episomal persistence to plasmids 

P 

B following transfection in cell lines, gives a mechanistic framework for ITR effects 

q 20 on plasmid persistence. Furthermore, the correlation that AAV circular 

P intermediates have increased persistence in cell lines in vitro, lends support to the 

hypothesis that these structures represent stable episomal forms following rAAV 
transduction in muscle. Stability of circular intermediates in vivo might be mediated 
by the binding of cellular factors to "Holliday-like" junctions in ITR arrays which 
25 stabilize or protect DNA from degradation. 

rAAV has been shown to be an efficient vector for expressing transgenes in 
various tissues in addition to muscle, such as brain, retina, liver, lung, and 
hematopoetic cells (Snyder et al., 1997; Muzyczka, 1992; Kaplitt et al., 1994; Walsh 
et al., 1994; Halbert et al., 1997; Koeberl et al., 1997; Conrad et al., 1996; Bennett et 
30 al., 1997; Flannery et al., 1997). Despite these advances in the application of rAAV, 

67 



the mechanisms of in vivo rAAV-mediated transduction and persistence of 
transgene expression still remain unclear. Such questions as to the molecular state 
of rAAV following in vivo delivery is highly relevant to the clinical application of 
this viral vector. For example, should rAAV primarily persist as an randomly 
integrated pro virus, the potential for insertional mutagenesis could present a major 
theoretical obstacle in the use of this vector due to the potential for mutational 
oncogenesis. The demonstration that rAAV can persist as episomes suggest that 
random integration and associated risks of malignancy may not be a major concern 
for this viral vector system. Additionally, the molecular determinants of AAV 
circular intermediates associated with increased persistence in cell lines appear to be 
contained within the DNA elements encompassing the inverted ITRs. The isolation 
of this naturally occurring viral DNA element, which forms as part of the AAV life 
cycle and acts to stabilize circular episomal DNA, may prove useful in increasing 
the efficacy of both viral and non- viral gene therapy vectors. 



Example 3 

Evidence for Increased Episomal Persistence of AAV Circular Intermediates in a 
Model for in utero Plasmid-Based Gene Therapy 

Persistence of AAV circular intermediates were assessed by injection of 
plasmid DNA directly into the pronucleus of fertilized Xenopus oocytes. Twenty- 
five ng of the p81 isolate of AAV circular intermediates was injected at the single 
cell stage of fertilized Xenopus oocytes. This plasmid was compared to the pro viral 
plasmid pCisAV.GFP3ori, which contains two ITRs separated by stuffer sequence 
in an alternative confirmation to ITRs in p81. Figure 13 depicts the persistence of 
GFP plasmids as assessed by direct fluorescence of GFP. At this state of tadpole 
development, the fertilized oocyte has expanded from a single cell to approximately 
10 6 cells. 

These studies confirm that AAV circular intermediates (p81) confer a higher 
level of stability in development Xenopus oocytes than plasmids containing similar 
transcriptional elements and ITR sequences in an alternative confirmation. Given 



68 



that in the case of p81 injected oocytes, tadpoles are completely fluorescent, the data 
suggests that some level of integration may have occurred. 

Example 4 

Liposome Mediated Transfer of Vectors of the Invention to the Airway and Muscle 

Studies evaluating the mechanisms of recombinant adeno-associated virus 
(AAV) transduction have identified a novel molecular intermediate responsible for 
episomal persistence. This intermediate is characterized by a circularized AAV 
genome with head-to-tail ITR repeats. Circular intermediates of rAAV were 
identified using a recombinant shuttle vector capable of propagating circularized 
viral genomes in bacteria. Pivotal experiments in cell lines demonstrate that the 
formation and persistence of these circular intermediates are augmented in the 
presence of helper adenovirus. These findings suggest that cellular factors induced 
by adenoviral gene expression may modulate both the formation and/or persistence 
of AAV circular intermediates. Furthermore, studies in muscle have demonstrated 
that following rAAV infection, the formation and persistence of AAV circular 
intermediates correlates with the onset and maintenance (at 80 days) of transgene 
expression, respectively. Moreover, a 300 bp fragment encompassing the head-to- 
tail inverted ITR repeats found in AAV circular intermediates when cloned into 
heterologous expression plasmids can confer increased stability to those plasmids in 
HeLa cells. The structural aspects of AAV circular intermediates may lead to 
development of non- viral, plasmid based, gene transfer vectors with increased 
persistence of transgene expression. 

To determine whether AAV circular intermediates which differ in length 
and/or sequence of the ITR array are more efficacious plasmid based vectors for 
liposome-mediated gene transfer to the airway and muscle, several distinct forms of 
AAV circular intermediates are evaluated as plasmid-based delivery systems in 
three model systems of the airway including: 1) in vitro polarized primary airway 
epithelial monolayers, 2) mouse lung, and 3) human bronchial xenografts. 
Persistence is evaluated at both the level of transgene expression (using GFP and 
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luciferase reporters) and at the level of episomal and integrated transgene derived 
DNA. Studies are performed to assess whether integration can be specifically 
enhanced by co-transfection with Rep DNA or mRNA. These studies also evaluate 
both the extent of integration and site specificity to AVS1 sites in chromosome 19 
of human model systems. 

Gene therapy using plasmid-based delivery systems have encountered 
several obstacles to efficient transgene expression. These obstacles include transient 
expression of transgenes and rapid degradation of DNA. In contrast, viruses have 
developed efficient mechanisms for transducing cells and expressing encoded viral 
genes. The molecular characteristics of AAV circular intermediates which confer 
increased persistence of transgene expression include a DNA element encompassing 
the head-to-tail ITR. Based on the findings that circular intermediates have 
increased episomal persistence in muscle following rAAV transduction, these 
structures may also have increased persistence as plasmid-based vehicles to the 
airway. Interestingly, several naturally occurring mutations which are found in 
approximately 50% of AAV circular intermediates affect the stability of the 
intermediate. 

Several findings evaluating the efficiency of AAV circular intermediate 
formation from recombinant viral vectors have suggested that these structures are 
augmented in abundance by the presence of the E2a adenoviral gene product. These 
molecular structures may represent preintegration intermediates which, in the case 
of wild-type AAV, would efficiently integrate into the cellular genome by Rep 
facilitated mechanisms. However, in the case of recombinant AAV genomes (in the 
absence of Rep proteins), evidence suggests that these structures have increased 
episomal stability. To test whether exogenous addition of Rep and/or E2a can 
increase the efficacy of AAV circular intermediates by modulating their stability 
and/or integration, co-transfection methods with Rep encoding plasmids and mRNA 
are conducted. Additionally, exogenously supplied E2a DNA binding protein 
(DBP) may also enhance stability of AAV circular intermediates. Rep may increase 
the integration of circular intermediates while E2a may increase their episomal 
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stability. Several observations including the association of E2a DBP with AAV 
genomes in the nucleus support a direct interaction between DBP and AAV circular 
intermediates. Furthermore, if DBP associates with AAV circular intermediates, its 
encoded nuclear localization sequence (NLS) may enhance nuclear sequestration of 
5 these plasmids in the nucleus. Alternatively, E2a may act to alter the persistence of 
AAV circular intermediates through the induction of cellular factors which interact 
with the ITR array. 

Liposome mediated gene transfer to the airway has considerable advantages 
due to the low level of toxicity. However, limitations include transient low level 
10 expression in differentiated airway epithelia. Despite this apparent limitation, 
several laboratories have had considerable success with the use of cationic 
liposome-mediated gene transfer in several animal models including mouse and rat 
nip lung, and numerous laboratories have pursued clinical trials, which suggested that 

these vehicles may show promise for gene therapy of the cystic fibrosis (CF) lung. 



pi 



4* 15 Thus, delivery of the present vectors in plasmid form via liposomes may be a safe 

y i 

and effective vehicle for gene transfer to the airway. 
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To assess whether AAV circular intermediates may also have increased 
persistence in airway epithelial cells as seen in Hela cells, several distinct forms of 
circular intermediates delivered by liposome-mediated transfection into primary 

20 airway epithelial cells, are evaluated. Based on the diversity of ITR repeat elements 
between various isolated circular intermediates (i.e., including 0, 1,2, and 3 ITRs), 
circular intermediates isolated from later time points in muscle may have been 
naturally selected for increased stability in vivo. Hence, the structural consistencies 
between AAV circular intermediates are identified which give increased persistence 

25 as plasmid based vectors for gene transfer. 

Circular intermediates containing the GFP reporter gene and 1, 2, and 3 ITRs 
are transfected into primary airway cultures and polarized epithelial cell monolayers 
using the cationic lipid GL-67 (Genzyme Inc.). DNA to lipid ratios are optimized 
using a luciferase reporter. Additionally, the addition of EGTA, or the use of 

30 calcium-free media, can increase the extent of gene transfer about 10-fold, and may 
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be included to enhance gene transfer to polarized epithelial monolayers. To evaluate 
persistence and expression of transgenes from circular intermediates, direct 
fluorescent microscopy and Southern blotting of both Hirt and genomic DNA with 
GFP P 32 -labeled probes are utilized. Proliferating cultures of primary airway 
epithelial cells can be passaged up to 4 times during this analysis. In contrast, 
polarized epithelial monolayers are evaluated at 1 week intervals for DNA 
persistence for up to 6 weeks. Since GFP transgene expression may be low and 
difficult to detect by direct fluorescence, GFP is quantitated by fluorometer of cell 
lysates. 

Following AAV transduction, circular intermediates may form within cells 
and certain structures of these intermediates may persist by virtue of affinity for 
cellular factors which bind at ITR arrays. If this is true, then it may be possible to 
select for and isolate optimal circular intermediates with increased persistence in 
airway cells by batch screening of circular intermediates pools from rAAV infected 
airway epithelia. 

Primary airway epithelia cell cultures are infected with AV. GFP3ori (MOIs 
of 1000 to 10,000 DNA part/cell) and low molecular Hirt DNA is prepared at 5-15 
days post-infection. Hirt DNA containing circular intermediates from rAAV 
infected cells is used to then transfect primary airway epithelial cells from which 
Hirt DNA is prepared at 5-15 days post-transfection. This second Hirt isolation is 
then used to isolate replication competent plasmids following transformation into 
bacteria. This selection process may give rise to those populations of circular 
intermediates with increased episomal persistence in airway epithelial cells. 
Selected clones of circular intermediate plasmids isolated by this procedure are then 
tested individually for increased persistence following liposome mediated 
transfection. These studies are performed in a batch type screening in 24 well plates 
using two serial passages for persistence. Once plasmids having increased 
persistence are isolated, their structure and sequence of ITR arrays are characterized. 
Since screening is performed on small-scale cultures, it may be necessary to 
implement semi-quantitative screening for DNA persistence within the first round of 
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transfection using PCR methods. Candidate plasmids with a high level of increased 
persistence as compared to control plasmids which lack ITR sequences but contain 
the identical promoter-reporter element, are evaluated on a larger scale transfection 
amenable to analysis by Southern blotting of total DNA. 

To evaluate selected circular intermediate structures in vivo, two models 
including mouse lung and the human bronchial xenograft are employed. 10 wk 
BalbC mice are transfected with GL-67/DNA complexes at a ratio of 25 jig 
plasmid/25 |j.g lipid in an iso-osmotic solution of Dextrose. At 1, 5, 10, 15, and 
20 days post-transfection lungs of mice are harvested for immunofluorescent 
detection of GFP in formalin fixed sections and for quantitative fluorometry of 
tissue lysates. Southern blots are employed to evaluate the persistence of plasmids 
in Hirt and genomic DNA. In addition to evaluating the persistence of selected 
circular intermediates which have the highest level of persistence with in vitro 
models, luciferase constructs are evaluated in which the ITR array has been cloned 
either 5' or 3' to the reporter gene. Furthermore, the use of luciferase reporters 
allows for more sensitive assessment of transgene activity in cell lysates. 

Similarly, in vivo persistence of transfected circular intermediates and 
heterologous plasmids containing ITR arrays found within circular intermediates is 
evaluated in human bronchial xenografts. 

Findings evaluating the effects of adenoviral co-infection on circular 
intermediate formation and persistence have suggested that E2a DBP leads to a 10- 
fold increase in the abundance of circular intermediates as compared to E2 deleted 
virus. Furthermore, studies with El -deleted virus have demonstrated that the 
persistence of circular intermediates in Hela cells is increased at 72 hours post- 
infection. These studies suggest that E2a DBP may augment circular intermediate 
formation and/or increase the stability of these structures by an unknown 
mechanism. E2a DBP may interact directly with circularized genomes and/or 
induce cellular factors which interact with sequences in these AAV genomes. Since 
DBP encodes an NLS, this protein may act to shuttle circular intermediates to 
regions of nucleus that allow for increased stability of these structures. NLS 
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sequences have been shown to cooperatively interact with nucleolar targeting 
sequences and hence we will also evaluate if subnuclear targeting is important in 
maintaining the increased stability of circular intermediates containing ITR arrays. 
Furthermore, it is currently unknown where circular intermediates form in the cell 
and it remains plausible that they may form in the cytoplasm or nucleus. Hence, if 
DBP associates directly with circular intermediates, it may act as an NLS for DNA 
to enter the nucleus as well. 

Several in vitro reconstitution models are used to investigate the interaction 
of circular intermediates with DBP and their affect on in vivo persistence following 
DNA transfection in Hela cells. Furthermore, results evaluating the affects of 
various mutant adenoviral vectors on circular intermediate and Rfm/Rfd formation 
have suggested that these two types of intermediates occur by independent pathways 
indicative of latent and lytic infection, respectively. In the setting of wild type 
AAV, circular intermediates may be pre-integration complexes, which in the 
presence of Rep, efficiently integrate into the host genome. In contrast, in the 
absence of Rep, circular intermediates may accumulate episomally in rAAV infected 
cells. To this end, methods of supplementing Rep function may be capable of 
enhancing integration of plasmid based delivery of AAV circular intermediates. 
Thus, experiments in which co-transfection of circular intermediate plasmids with 
Rep expression plasmids or mRNA are conducted. 

To investigate whether DBP can augment the stability of circular 
intermediates by increasing targeting to the nucleus, a Hela cell line (gmDBP6) is 
utilized which encodes an inducible E2a gene under a dexamethasone responsible 
element. This cell line gives rise to high levels of DBP in nuclear extracts by 
Western blot following treatment with dexamethasone. gmDBP6 cells (+/- DEX) 
are transfected with various AAV circular intermediate plasmids containing 0, 1,2, 
and 3 ITRs and total cellular and nuclear plasmid content evaluated by subcellular 
fractionation using Southern blotting against GFP probes. The time course of these 
studies is initially within the range of 12 hours to 4 days post-transfection. 
Transgene expression is evaluated by fluorometry (in cell lysates), and fluorescent 
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microscopy (in viable cells), for GFP and luminescence for luciferase. Hela cells 
have demonstrated that immediate increases in transgene expression from AAV 
GFP circular intermediates as compared to control GFP plasmids occur as early as 
24 hours post-transfection. Thus, certain cellular factors may facilitate an 
immediate accumulation of circular intermediates in the nucleus. DBP may invoke 
this increase by either direct interactions with ITR sequences or by the induction of 
cellular factors. To evaluate the potential for direct interactions between DBP and 
circular intermediates, various form of ITR arrays found within circular 
intermediates are end-labeled with y-ATP 32 and evaluated for binding by 
electrophoretic mobility shift assays to nuclear extracts from gmDBP6 cells (+/- 
DEX). Supershifts, with DBP antibodies and competition experiments with cold 
ITR sequences and non-specific DNA, are used as controls for specific binding. 

In a second model system aimed at evaluating the potential of DBP for 
shuttling and/or sequestering of circular intermediates to the nucleus, microinjection 
experiments in oocytes are performed with 50 ng of plasmid DNA of circular 
intermediates with and without 50 ng of DBP mRNA. Experiments initially 
evaluate the time course of GFP transgene expression (+/- DBP cRNA) by direct 
fluorescent microscopy. If major differences are seen, quantitative fluorometry of 
individual whole oocytes in 96 well plates is conducted. Similar studies on nuclear 
targeting in the presence of DBP can also be evaluated in this model by pooling 
microinjected oocytes for nuclear isolation and Southern blot analysis. 

A third experimental model to evaluate nuclear targeting and/or 
accumulation of circular intermediate vectors in the presence and absence of DBP 
involves the microinjection of fluorescently labeled plasmid DNA into the 
cytoplasm and real time imaging to follow the nuclear accumulation of DNA. The 
DNA fluorescent dye, TOTO-1, is used to label DNA prior to injection. This dye 
forms an extremely stable complex with negligible diffusion and re-incorporation 
into nuclear DNA following transfection into polarized airway epithelial cell 
monolayers. Co-localization of DBP with wtAAV DNA genomes at focal hot spots 
within the nucleus supports the observation that nucleolar targeting may be 
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important for persistence. These experiments are also performed in primary airway 
epithelial cells and in vivo models of the airway by either co-transfection of circular 
intermediates with DBP expressing plasmids and/or mRNA. 

The effects of Rep co-transfection on the integration of circular intermediate 
plasmids is also evaluated. Two methods are used to express Rep including: 1) co- 
transfection with Rep expressing plasmids, and 2) co-transfection with Rep 
encoding mRNA. Initially, Hela, CFT1, and IB-3 cells are tested, as transformed 
cells may be more amenable to expansion and evaluation of integration. Both CFT1 
and IB-3 cells represent airway epithelial cells. Experiments are performed by 
cationic liposome (GL-67) mediated transfection of circular intermediate DNA with 
varying doses of a Rep-containing expression vector, e.g., pCMVRep. The extent of 
integration is also evaluated by two criteria, Southern blotting of Hirt and genomic 
DNA and clonal expansion of GFP expressing cells. Since Southern blot has an 
approximate limit of sensitivity of 1 integrated plasmid molecule per 10 cellular 
genomes, clonal expansion may be necessary to evaluate persistence in less 
transfectable cells such as CFT1 and IB-3 cells. Cell lines are evaluated over the 
course of 1-10 passages. 

Sustained expression of Rep by plasmid mediated co-transfection may be 
toxic to cells, hence co-transfection with Rep mRNA is also evaluated. Cationic 
liposome:mRNA mediated transfection has been previously shown to work in cell 
lines and although the level of expression is much more transient than for DNA, in 
these studies it may be an advantage. Initial studies are performed with in vitro 
transcribed Rep mRNA alone to evaluate the jig amount of mRNA needed for Rep 
expression as determined by Western blot. Once the threshold for detectable Rep 
expression is established, increasing amounts of Rep mRNA are co-transfected with 
circular intermediate DNA. Similar assays are used as described above to evaluate 
the extent of AAV circular intermediate integration. If findings suggest that 
increased integration if facilitated by Rep, the site specificity of this integration can 
be evaluated by cloning GFP expressing cells after the 10th passage by serial 
dilution. These GFP expressing clones are expanded and genomic Southern blots 



76 



assessed with both GFP and AVS1 specific probes. By evaluating a number of 
restriction enzymes which either do not cut or cut once within the circular 
intermediate plasmid, it will be determined whether integration has occurred at the 
AVS1 loci. 

To test whether secondary structure rather than primary sequence is the 
important determinant of increased episomal stability of AAV circular 
intermediates, synthetic DNA sequences are generated with identical secondary 
structure to several ITR arrays in circular intermediates. The primary sequence is 
completely altered and bares no resemblance to sequences contained within native 
AAV ITRs. These synthetic DNA sequences are tested for their ability to confer 
increased episomal stability to heterologous plasmids in several model systems 
including: 1) the airway, 2) muscle, 3) and developing Xenopus embryos. The 
developing Xenopus embryo model is ideal for testing integration and persistence of 
plasmid based vectors for application of in utero gene therapy. If synthetic DNA 
sequences with similar secondary structure to ITRs are found to confer increased 
persistence to plasmid based vectors, then determinants for protein binding which 
facilitate persistence are independent of primary base sequence. These studies allow 
the optimization of the secondary structural requirements by synthesizing a wide 
range of DNA molecules with varying degrees of palindromic repeats. Furthermore, 
the secondary structure may not bind proteins directly but facilitate recombination 
of plasmids to large concatamers which have increased episomal stability or 
enhanced integration efficiencies. 



Example 5 

Delivery of Multiple Genes through Intermolecular Concatamerization 
Methods 

Recombinant AAV vectors . 

Two rAAV vector stocks were generated for use in these studies, 
AV.GFP3ori (Example 1) and AV.Alkphos (also known as CWRAPSP, a gift of 
Dusty Miller) (Halbert et al., 1997). Virus stocks were generated by co-transfection 
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of 293 cells with either pCisAV.GFP3ori or pCWRAPSP along with pRep/Cap, 
followed by co-infection with recombinant Ad.CMVlacZ helper virus (Example 2). 
rAAV was then purified through three rounds of CsCl density gradient 
centrifugation as previously described by Duan et al. (1997). Purified viral fractions 
were heated at 60°C for 1 hour to inactivate any residual contaminating helper 
adenovirus. The yields for AV.GFP3ori and AV.Alkphos were 1 x 10 12 and 7 x 10 11 
particles per ml, respectively, as determined by slot blot hybridization with 32 P- 
labeled GFP or Alkphos probes. Infectious titers determined by infection of 293 
cells with rAAVs were 1.1 x 10 9 IU/ml (AV.GFP3ori) or 8.6 x 10 8 IU/ml 
(AV.Alkphos). Controls testing for contamination of rAAV stocks with wtAAV by 
anti-Rep immunocytochemical staining in rAAV/ Ad.CMVlacZ co-infected 293 cells 
were negative (limit of sensitivity is less than 1 infectious wtAAV particle per 10 10 
DNA particles of rAAV). Similarly, histochemical staining for P-galactosidase in 
rAAV infected 293 cells showed no detectable contamination with helper 
adenovirus in 10 10 DNA particles of rAAV (limit of sensitivity). 
Infection of muscle tissue and evaluation of transgene expression . 

The C57BL/6 mice used for these experiments were housed in a virus-free 
animal care facility and were maintained under strict University of Iowa and NIH 
guidelines, using a protocol approved by the Animal Care and Use Committee and 
facility veterinarians. Four to five week old mice received bilateral 30 |il injections 
of a mixture of both AV.GFP3ori and AV.Alkphos into the tibialis anterior muscle 
(5 x 10 9 DNA particles of each virus per muscle). Controls included uninjected 
muscles and muscles receiving injections of one of the viruses alone. At 14, 35, 80, 
and 120 days post-infection, animals were euthanized and tissues were harvested for 
evaluation of transgene expression and preparation of low molecular weight Hirt 
DNA. For each experimental time point, at least 3 independently injected muscles 
were evaluated. 

In all experiments, GFP fluorescence was visualized in freshly excised 
muscle tissue prior to processing. A portion of the same muscle was fixed with 2% 
paraformaldehyde in phosphate buffered saline, and cryoprotected in graded sucrose 
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solutions before embedding in optimal cutting temperature medium (OCT). 
Sections (6 urn) were then evaluated for GFP expression directly and Alkphos 
expression following heat inactivation of endogenous Alkphos and histochemical 
staining for Alkphos activity (Engelhardt et al., 1995). To confirm dual localization 
of GFP and Alkphos expression in the same muscle fibers, either serial sections 
were evaluated for GFP and Alkphos expression or the same section was first 
photographed for GFP expression followed by histochemical staining for Alkphos 
and re-imaging of the same field. 
Rescue of circular intermediates from muscle Hirt DNA . 

Low molecular weight Hirt DNA was prepared from 20 mg specimens of 
injected muscles from 3 animals at each time point (Example 2). Hirt DNA (4 |il; 
1/5 of the total volume) was then used to transform 50 of electrocompetent SURE 
cells (Stratagene) using a BioRad E. coli electroporater and 0.1 (im cuvettes. 
Colonies resulting from each bacterial transformation were quantified, and plasmids 
from 20 colonies from each muscle Hirt DNA sample were purified for analysis. It 
should be noted that only circular forms carrying the Amp resistance gene and the 
bacterial origin of replication from AV.GFP3ori are rescued by bacterial 
transformation (Duan et al, 1998). Control experiments reconstituting 5 x 10 10 viral 
DNA particles into uninfected muscle extracts prior to Hirt DNA preparation failed 
to give rise to replication competent plasmids in the rescue assay (Duan et al., 
1998). Additional controls in Duan et al. (1998) using AV.GFP3ori virus also 
demonstrated that linear double stranded and single stranded purified viral DNA 
genomes do not give rise to replication competent plasmids following 
transformation into E coli. 

Characterization of encoded genes in rescued circular intermediates . 

Several assays were used to characterize the extent of intermolecular 
recombination between independent circular viral genomes by evaluating the 
number and type of encoded genes in rescued plasmids from Hirt DNA of muscles 
co-infected with AV.GFP3ori and AV.Alkphos. Initial analysis involved the bulk 
evaluation of 60 rescued plasmids (20 from each of three muscle samples for each 
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time point) by dot blot hybridization of mini-prep DNA with EGFP, Alkphos, and 
Amp 32 P-labeled DNA probes. In these studies, Amp hybridization served as a 
control to show that there was a sufficient quantity of DNA for the analysis. The 
percentages of Alkphos and/or GFP hybridizing plasmids were calculated by this 
5 method for each muscle sample. From this percentage, the total number of plasmids 
hybridizing to each probe in the Hirt DNA sample was calculated from the total 
CFU obtained in each transformation. In this analysis, each muscle sample was 
evaluated independently to determine the mean (+-/-SEM) total Alkphos and/or GFP 
hybridizing plasmids. A second evaluation involved the transfection of rescued 
10 plasmids into 293 cells using lipofectamine, followed by evaluation of GFP 
fluorescence and histochemical staining for Alkphos. To confirm that GFP and 
Alkphos co-expressing plasmids were indeed clonal and that both genes were 
encoded on the same plasmid, a selected group of five co-expressing plasmids were 
retransformed into E.coli and colonies were re-isolated prior to repeating the 
15 transfection studies. In all cases, plasmids co-expressing the two reporter genes 
remained clonal through this subsequent re-isolation. 
Structural analysis of concatamer rAAV circular intermediates . 
H» To further characterize the nature of isolated circular intermediates co- 

p{ expressing both GFP and Alkphos transgenes, plasmid structure was mapped by 

EH 20 Southern blotting and restriction enzyme analysis. The structural of five co- 

expressing circular intermediate plasmids were determined by digestion with Ahdl, 
Hindlll, NotI, Hindlll/NotI, Clal/Asel, and/or SnaBI and Southern blotting was 
performed with 32 P-labeled GFP, Alkphos, and ITR probes. 
Results 

25 Strategy for characterizing mechanisms of rAAV circular intermediate formation . 
Efficient circularization of rAAV genomes has been previously 
demonstrated to occur in muscle in a time dependent fashion (Example 2). 
Furthermore, the conversion of monomelic to multimeric circular rAAV 
intermediates occurred over time and was associated with long-term episomal 
30 persistence of AAV genomes. High molecular weight AAV circular genomes might 
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form by either of the following two mechanisms, one involving the replication of 
monomer structures and the other through intermolecular recombination between 
independent monomers. A rescue assay was developed using two separate rAAV 
vectors, AV.GFP3ori and AV.Alkphos (Figure 14A), which allowed for the 
identification of independent viral genomes through unique transgenes. In this assay, 
circular form genomes were rescued in bacteria by virtue of Amp/ori sequences 
encoded in one of the two vectors (AV.GFP3ori). A method for characterizing the 
extent of intermolecular recombination between independent circular rAAV 
genomes was shown in Figure 14B. 

Co-expression of independently encoded rAAV transgenes in muscle mvofibers . 

To confirm that myofibers can be co-infected at a high efficiency with the 
two rAAV vectors, the tibialis anterior muscle of mice was co-infected with 5 x 10 9 
DNA particles of both AV.GFP3ori and AV.Alkphos. At 14, 35, 80, and 120 days 
post-infection, muscles were harvested and analyzed for transgene expression. 
Transgene expression from both reporters was weak but clearly visible in 14 day 
muscle samples. By 80 days post-infection, transgene expression was maximal and 
serial sections demonstrated expression of both Alkphos and GFP transgenes in 
overlapping regions of the muscle (Figures 15A-C). At this time point, 
approximately 50% of the fibers in the tibialis muscle expressed both transgenes. 
To confirm that co-infection of myofibers occurred with the two independent 
vectors, co-localization studies were performed on muscle sections by a serial 
staining procedure. These studies, depicted in Figure 15D, demonstrate four classes 
of myo fiber transgene expression: 1) GFP positive only, 2) Alkphos positive only, 
3) GFP/Alkphos positive, and 4) no transgene expression. The largest fraction of 
myofibers expressed both GFP and Alkphos transgenes. These results confirm that 
at the titers of virus used for infection, co-infection occurred in greater than 90% of 
transgene expressing myofibers. 

Rescue of bi-functional rAAV circular intermediates increases over time . 

To determine the extent of recombination between circular AAV genomes, 
circular form genomes were rescued as plasmids from low molecular weight Hirt 
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DNA of muscle tissue co-infected with AV.GFP3ori and AV.Alkphos. Following 
transformation of E.coli Sure cells with Hirt DNA purified from infected muscles, 
the total number of GFP and Alkphos hybridizing Amp resistant bacterial plasmids 
was quantitated for each time point post-infection (Figure 16A and B) (Duan et al., 
1995), the abundance of circular AAV genomes rescued from AV.GFP3ori 
increased over time. For each muscle sample (three for each time point) twenty 
plasmid clones were evaluated for hybridization to GFP and Alkphos DNA probes 
and the total number of plasmids was back calculated from the total CFU for each 
individual muscle sample. Figure 16B demonstrates the mean (+/-SEM, N=3) total 
plasmids that hybridized to GFP or GFP/Alkphos probes at each time point. At 14 
days post-infection, GFP/Alkphos co-hybridizing plasmids were never observed. In 
contrast, at time points after 35 days the percentage of GFP/Alkphos co-hybridizing 
plasmids increased with time and reached 33% by 120 days (Figure 16C). Since 
bacterial plasmid rescue can only occur through AV.GFP3ori genomes, this data 
suggests that recombination between independent Alkphos and GFP rAAV genomes 
takes place over time. These results are consistent with studies described 
hereinabove demonstrating a time dependent concatamerization of monomer 
circular rAAV genomes in muscle. 

To evaluate the ability of circular intermediates to express encoded 
transgenes, transient transfection studies were performed in 293 cells with rescued 
circular intermediate plasmids (Figures 17A-C). Between 85-90% of rescued 
plasmids hybridizing to GFP probes on slot blots also expressed the GFP transgene 
in this transfection assay (Figure 17D). The percentage of GFP expressing plasmids 
that also expressed Alkphos rose over time in concordance with the hybridization 
data (Figure 17D). However, approximately 40-50% of plasmids which were 
hybridization positive for Alkphos did not express the Alkphos transgene. This may 
represent recombinational deletion of the RSV promoter driving Alkphos expression 
which occurred during concatamerization at sites near the 5' ITR. These results 
demonstrate that intermolecular recombination between Alkphos and GFP derived 
circular intermediates occurs as part of the time dependent concatamerization 
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process of rAAV in muscle. To confirm that amplified plasmids stocks expressing 
both reporter genes were actually clonal (i.e., one plasmid rather than two 
independent plasmids resulting from contamination), a select number of bacterial 
clones expressing both transgenes were re-isolated and the transfection assays were 
repeated. In all cases, plasmids expressing the two reporter genes remained clonal 
through two rounds of bacterial cloning. Hence, dual reporter expression was not 
due to contamination of independent GFP and Alkphos expressing plasmids. 
Concatamerization of AAV circular intermediates occurs through uniform 
intermolecular recombination between ITRs of independent viral genomes . 

To better understand the mechanisms of circular concatamer formation, a 
detailed structural analysis was performed of five bi-fiinctional circular concatamers 
isolated from rAAV infected muscle samples. As previously described for the 
AV.GFP3ori genome (Example 2), the conversion of monomelic circular AAV 
genomes to large multimeric circular concatamers with a predominant head-to-tail 
structure increased with time in muscle. To evaluate the structure of bi-fiinctional 
circular concatamers, restriction enzyme mapping and Southern blot analysis using 
32 P-labeled EGFP, Alkphos, and ITR probes was employed. Results from five 
analyzed plasmids demonstrated between 3-6 genomes within these circular 
concatamers. Two representative structures from 35 and 80 day time points are 
shown in Figure 18. Several interesting conclusions can be made from this 
structural analysis. As described , head-to-tail oriented genomes could be seen in all 
isolated concatamers. However, several examples of head- to-head and tail-to-tail 
genome combinations of AV. Alkphos and AV.GFP3ori were also seen. Since head- 
to-head and tail-to-tail genome concatamers were never seen in muscles infected 
with AV.GFP3ori alone, there must be a selective disadvantage for bacterial 
replication when ori sequences are in either of these conformations. However, since 
the AV. Alkphos genomes do not contain a bacterial origin of replication, this 
orientation is permitted in chimeric concatamers. Second, noticeable deletions 
and/or loss of restriction sites close to ITRs were noted (Figure 17). It is not known 
whether deletions close to the ITR are a common event in the concatamerization 
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process, but if so, this could account for the fact that only 60% of GFP/Alkphos 
hybridizing circular intermediates also expressed the Alkphos transgene. 
Discussion 

Concatamerization of rAAV genomes has long been recognized in integrated 
pro viral genomes. Recently, the association of this concatamerization process with 
the formation of high molecular circular genomes in muscle has suggested that this 
process may also be important in episomal persistence. The findings described 
herein demonstrated rescue of independent viral genomes within the same circular 
concatamer, suggesting that this process of concatamerization occurs through 
intermolecular recombination. Furthermore, at 14 days the predominant form of 
viral genome in muscle was circular monomers (Example 2), which correlates with 
the results described above demonstrating only GFP expression in rescued circular 
intermediates at this time point. Together with the fact that bi-functional rescued 
circular concatamers increase with time, these results suggest that large concatamers 
form by recombination of monomelic circular precursor genomes. Furthermore, 
since an alternative model of concatamerization by rolling circular replication would 
be expected to yield only GFP expressing rescued plasmids in this system, this 
mechanism does not appear responsible for concatamerization. 

Based on the structural analysis of these bi-functional circular intermediates, 
recombination between monomelic circular rAAV genomes is likely facilitated 
through ITR sequences. Directionality of this recombinational event does not 
appear to play a significant role, since head-to-tail, head-to-head, and tail-to-tail 
oriented intermolecular concatamers were found. In addition, the extent to which 
recombination within ITR repeat regions occurs in bacteria is presently unknown 
and may account for the deletions and/or restriction site losses near ITR arrays. 
However, serial passaging of bi-functional circular AAV genomes in bacteria has 
suggested that the structure of these large concatamers is impressively stable in 
bacteria. 

Intermolecular recombination of rAAV genomes to form single circular 
episomes may be particularly useful for gene therapy. For example, large regulatory 
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elements and genes beyond the packaging capacity of rAAV may become linked 
after co-infecting tissue with two independent vectors (Figure 19). This strategy 
could also involve trans-splicing vectors encoding two independent regions of a 
gene which are brought together to form an intact splicing unit by circular 
concatamerization. 

Example 6 

Enhancement of Recombinant AAV Mediated Gene Expression 
Through Intermolecular C/s-activation 

Materials and Methods 
Recombinant AAV vectors . 

The pcisAV.Luc pro viral plasmid was generated by cloning the 1983 bp 
Nhel/BamHI fragment from pGL3-Basic (Promega), containing the luciferase gene 
and S V40 late polyA signal, by blunt-end ligation into the blunted Xba I site of 
pSub201 (Samulski et al., 1987). Similarly, pcisAV.SV(P)Luc was generated using 
a blunted 2175 bp Nhel/BamHI fragment, from the pGL3-Promoter (Promega), 
containing the SV40 promoter, luciferase gene, and SV40 late polyA signal. The 
pcisAV.SV(P/E)Luc plasmid was generated by blunt-end ligation of a 2427 bp 
Nhel/Sall fragment from pGL3-Control (Promega) into the blunted Xba I site of 
psub201. This construct contains the SV40 promoter, luciferase gene, SV40 late 
polyA signal and SV40 enhancer. 

The "super-enhancer" vector, pcisAV.SupEnh, was produced using a two- 
step cloning process. First, a 0.62 kb blunted Bglll/Pvul fragment containing the 
CMV immediate early enhancer from pIRES (Clontech) was subcloned into the 
blunted BamHI site in pGL3 -Control (Promega) to make pGL3-Control-CMVenh. 
Then a 0.92 kb DNA segment containing both the CMV immediate early enhancer 
and the SV40 enhancer was released by Clal/Sall double digestion of pGL3- 
Control-CMVenh and subsequently inserted into the blunted PstI site of 
pcisAV.GFP3ori (Duan et al., 1998b). The resulting pcisAV.SupEnh plasmid 
contains the SV40 enhancer, the CMV immediate early enhancer, the P-lactamase 
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gene, and a bacterial replication origin. The ampicillin resistance gene (0- 
lactamase) and bacterial original of replication were included in pcisAV.SupEnh to 
facilitate the subsequent rescue of circular AAV genomes from infected cells in 
bacteria. 

The control vector, pcisAV. AmpOri, was generated by blunt-end ligation of 
a 1 . 1 kb Sail digested stuffer sequence from the human glycosylasparaginase cDNA 
into PstI digested pcisAV.GFP3ori. This plasmid has a structure similar to that of 
pcisAV.SupEnh, except that it does not contain any enhancer elements. The 
pcisAV.AmpOri was used as a negative control for non-specific enhancement of 
transgene expression by intermolecular recombination of two different AAV 
vectors. 

The integrity of the ITR sequences in all the plasmids was confirmed by 
digestion with restriction enzymes, including Smal, MscI, and BssHII, which have 
unique cutting sites within different regions of ITR. All the viral preparations were 
obtained according to a method described in Duan et al. (1997). The quality of the 
viral stocks (i.e., contamination with adenovirus and/or wild type AAV) was 
confirmed as previously described in Duan et al. (1998b). The analyses showed less 
than 1 recombinant adenovirus and wt AAV infectious particles per 10 10 particles of 
rAAV. Viral titers were determined by quantitative slot-blot hybridization using 
either luciferase, CMV enhancer, SV40 enhancer, or ori probes for each of the 
respective vectors against plasmid copy number standards. 
Luciferase enzyme assays . 

Luciferase assays were performed from cell lysates harvested from either in 
v/>0-infected human fibroblasts or from in v/vo-infected mouse tibialis anterior 
muscle. Human fibroblasts were infected with virus in 60 mm dishes. These in 
vz/ro-infected cells were harvested at 3 days post-infection by rinsing cells with PBS 
twice, and then incubating with lx Report lysis buffer (Promega) (400 \i\ per 60 mm 
plate) at room temperature for 15 minutes. Cells were scraped into an eppendorf 
tube and centrifuged for 30 seconds at 14,000 rpm. Serial dilutions of supernatants 
were assayed for luciferase activity using reagents from Promega according to the 
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manufacturer's protocols. Luciferase activity was detected in triplicate for each 
individual sample with a Luminometer (TD-20/20, Turner Designs Instrument, 
Sunnyvale, CA), at the setting of 90.3% sensitivity, 1 second delay, 10 second 
measurement. Six independent samples were assayed for each experimental 
condition. 

For in vivo assay of luciferase activity, the anterior tibialis muscle of 8 week 
old C57BL/6 mice was infected with an indicated amount and type of AAV vectors 
(see Figure 23) in 30 jil phosphate-buffered saline (PBS). The entire muscle was 
harvested at 30 days or 90 days post-infection and weighed prior to cell lysate 
preparation. The muscle tissue was frozen in liquid nitrogen and pulverized by hand 
grinding with an ice-cold porcelain mortar and pestle. The muscle was further 
minced and homogenized in 100 jil of lx Report lysis buffer with a hand-held 
plastic pestle for 2 minutes (Kontes, Vineland, New Jersey). After 15 minutes 
incubation at room temperature, the crude lysates were spun for 30 seconds at 
14,000 rpm, and the supernatants were used for luciferase activity assay as described 
above. To minimize variability, all experimental samples were analyzed 
simultaneously using the same batch of luciferase assay reagents and were 
normalized to the protein content in the lysate. 
Results 

Co-administration of a czs-activating vector increases rAAV mediated luciferase 
expression in fibroblasts . 

To test the hypothesis that ds-activation from two independent AAV vectors 
can occur following intermolecular concatamerization, several rAAV vectors with 
defined regulator elements and/or the luciferase reporter gene were constructed 
(Figure 20). One of these vectors, AV.Luc, contains the luciferase transgene and an 
SV40 poly A signal but no promoter sequences. AV.SV(P)Luc is similar to AV.Luc 
except that an SV40 promoter (lacking the enhancer sequence) was inserted in front 
of the luciferase transgene. AV.SV(P/E)Luc includes both the SV40 promoter and 
enhancer, driving expression of the luciferase transgene, and was used as a control 
for maximal expression in the absence of intermolecular recombination with an 
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enhancer containing vector. To evaluate intermolecular m-activation by enhancer 
elements, an rAAV "super-enhancer" vector (AV.SupEnh) was also constructed, 
which contains S V40 and CMV enhancer regions without promoter or transgene 
sequences. A negative control vector (AV.AmpOri), which is similar to 
5 AV.SupEnh except that the enhancer sequences were replaced by a non-specific 
stuffer fragment, was also constructed. 

Enhancers are c/s-acting DNA sequences that can be recognized by 
regulatory proteins to stimulate transcription in a context independent manner 
relative to the promoter and transgene. If cells were co-infected with AV.SV(P)Luc 
10 and AV.SupEnh vectors, luciferase transgene expression could be significantly 
increased from the minimal S V40 promoter only if intermolecular recombination 
had occurred between the two independent vectors (Figure 21). However, in 
accordance with the definition of an enhancer, no activation should occur if the 
enhancer sequences and the transgene cassette are located in separate circular DNA 

m 

ir 15 molecules (Lewin, 1997). 

Hi Initial experiments were performed by infecting 1 x 10 6 primary human 

y i 

«F fibroblasts with single vectors [AV.Luc or AV.SV(P)Luc] at a multiplicity of 

iu infection (moi) equal to 1000 viral particles/cell. Additional experimental points 

W included co-infection of AV.Luc or AV.SV(P)Luc with either AV.SupEnh or 

Ql 20 AV.AmpOri at the same moi. 
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£ As shown in Figure 22, infection of fibroblasts with both the promoter-less 

[AV.Luc] and the minimal promoter [AV.SV(P)Luc] luciferase constructs alone 
gave only minimal expression at 3 days post-infection. However, co-infection with 
AV.SupEnh produced 16- and 35-fold inductions in luciferase expression from the 
25 AV.Luc and AV.SV(P)Luc vectors, respectively. Thus, c/s-activation of a minimal 
SV40 promoter can occur in the presence of a second AAV vector containing 
enhancer elements. Unanticipated, however, was the high level of transactivation of 
the AV.Luc construct, which contains no heterologous promoter sequences. These 
findings support earlier studies suggesting that ITRs contain a cryptic promoter 
30 (Flotte et al., 1993). The specificity of this induction was further demonstrated by 
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the lack of transactivation following co-administration of an alternative control 
vector, AV.AmpOri, which does not contain enhancer elements. 

To confirm that the transactivation observed was due to recombination of 
two independent AAV virus, Hirt DNA from infected cells was transformed into 
competent SURE E. Coli cells. As expected, no bacterial clones were retrieved 
from cells infected with either AV.Luc or AV.SV(P)Luc alone (neither vector 
contains amp r and ori sequences). However, in cells co-infected with AV.SupEnh 
(which contains amp r and ori sequences but no luciferase gene), approximately 4% 
of the rescued clones also contained the luciferase transgene, according to restriction 
enzyme mapping and Southern blotting analyses. Subsequent transfection of Hela 
cells with these rescued, circular concatamer plasmids demonstrated 103+/-6 fold 
higher luciferase activity from AV.Luc/ AV.SupEnh, as compared to 
AV.Luc/ AV.AmpOri recombined AAV genome plasmids. Taken together, these 
results indicated that intermolecular concatamerization of a reporter rAAV virus 
with an independent "super-enhancer" AAV vector substantially increased the 
efficiency of transgene expression in cultured human primary fibroblasts. 
Intermolecular cis-activation enhances AAV mediated luciferase expression in 
muscle tissue in vivo . 

To confirm whether the in vitro findings could also be applied to increase 
rAAV mediated gene expression in vivo, 2 x 10 10 particles of AV.Luc or 
AV.SV(P)Luc were injected into the tibialis anterior muscle of C57BL/6 mice, 
either individually or in combination with AV.SupEnh (2 x 10 10 particles). The 
infected muscles were harvested at 30 and 90 days post-infection. Consistent with 
previous findings (Example 5), intermolecular recombination of circular rAAV 
genomes increased from about 5% at the 30 day time point to 25% by 90 days post- 
infection. Importantly, co-administration of AV.SupEnh vector with either AV.Luc 
or AV.SV(P)Luc (Figure 23A) led to a functional enhancement of transgene gene 
expression, with a 30-fold increase observed at 30 days post-infection. By 90 days 
post-infection, greater than 200-fold [AV.Luc] and 600-fold [AV.SV(P)Luc] 
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increases in transgene expression were observed when AV.SupEnh was co- 
administered. 

Consistent with in vitro experiments in fibroblasts, the bacterially rescued 
concatamers containing both the luciferase transgene and AV.SupEnh also 
demonstrated greater than a 100-fold higher luciferase activity than the original 
proviral luciferase plasmid (pcisAV.Luc) alone. Interestingly, 90 day muscles 
infected with 2 x 10 10 particles of the control AV.SV(P/E)Luc vector (which 
contains both the SV40 enhancer and promoter) produced luciferase levels that were 
10- and 100-fold less than the levels seen following co-infection with 
AV.SupEnh/AV.Luc and AV.SupEnh/AV.SV(P)Luc, respectively. These findings 
underscore the potential of using large multi-enhancer segments to increase AAV 
mediated gene delivery through intermolecular c/s-activation. Such applications 
will likely have broad implications for in vivo AAV mediated gene delivery to 
organs for which concatamerization of the AAV genome is an inherent process of its 
latent life cycle. 

Both in vitro and in vivo results described herein demonstrated effective 
augmentation of transgene expression following intermolecular concatamerization 
between two independent AAV vectors carrying a transgene or enhancer sequences. 
A surprising finding was the strong induction of transgene expression from an AAV 
vector lacking an endogenous promoter (AV.Luc). Previous studies have implied 
weak promoter activity in the AAV ITR sequences (Flotte et al., 1993). Hence, it is 
possible that, in combination with enhancer sequences supplied by another AAV 
vector, a therapeutic level of transgene expression could be achieved for disease 
genes approaching the maximum packaging capacity for AAV. One notable 
example would be in applications of AAV for cystic fibrosis. Additionally, a 
particular interesting extension of this work will be to use large cell-specific 
enhancer regions for targeted expression. 



90 



Example 7 

Generation of rAAV Delivery Systems for Full-length CFTR and EPO 

Based on AAV Circular Concatamer Formation 
Given the findings that circularization and concatamerization are integral 
5 parts of rAAV transduction in the airway, rAAV transduction, circularization and 
concatamerization can be used to deliver CFTR transgene cassettes to the airway. 
Two approaches may be employed in which various genetic elements of the CFTR 
transgene cassette are split into two or more vectors which are then used for co- 
infection of the airway. The first approach, "/ra^s-splicing", utilizes a rAAV vector 
10 harboring the promo ter/enhancer driving the first half of CFTR DNA flanked by a 
donor splice site (vector- 1, Figure 19C) and a second rAAV vector harboring the 
second half of CFTR DNA and a polyA addition sequence preceded by a donor 

« acceptor site (vector-2, Figure 19C). A second approach, "czs-activation", employs 

yj 

ff| a first vector that harbors the entire CFTR transgene with a minimal synthetic 

ro 

1 5 promoter and a second vector comprises several strong enhancer sequences. 
Through the process of concatamerization, these two vectors are brought into 
juxtaposition with one another, allowing for splicing or m-activation of 
enhancer/promoter combinations. 

Initial experiments assess the extent of concatamerization in epithelia and in 
20 cell lines. Two rAAV CFTR plasmid-based vectors were prepared, pAV.RVSCF77? 
donor and pAV. CFTR acceptor. The splice consensuses sequences are based on the 
large T antigen splice, sites. A synthetically generated circular chimera was also 
constructed to represent rAAV circular concatamers. The chimera was generated by 
PCR-mediated approaches in which splice site consensuses were incorporated into 
25 the primers used to amplify each half of CFTR from pBQCFTR (Drumm et al., 

1990). Several clones were sequenced in their entirety and PCR errors removed by 
subcloning the appropriate correct fragment from pBQCFTR. 

Assays for splicing integrity include RNase protection assays of transfected 
MDCK cells for sequences within the large T antigen intron and surrounding 
30 regions. Polarized airway epithelial cells grown at the air-liquid interface are co- 
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infected with the donor and acceptor CFTR AAV vectors and CFTR gene 
expression in these cells is then monitored by both immunofluorescent localization 
and functional analysis of short circuit currents (Smith et al., 1992; Smith et al., 
1990). Additionally, functional assays using two electrode voltage clamp 
measurements (TEV) of oocytes following nuclear microinjection of the chimeric 
plasmid are used to demonstrate intact splicing and active CFTR channels (Zhang et 
al, 1998). Hirt analyses of episomal AAV species are used to correlate the efficacy 
and persistence of CFTR gene expression with the formation of AAV circular 
intermediates. 

Several systems are utilized to evaluate the efficiency of intermolecular 
recombination to form functional CFTR expressing circular concatamers. The first 
employs short circuit measurements in polarized CF airway epithelia. Conditions 
for rAAV infection include basolateral infection of polarized CF epithelia, 
preferably in the presence of agents that enhance the level of transgene expression. 
A second model employs CF human bronchial xenografts to evaluate CFTR 
function complementation. Reconstitution of xenografts with UV-irradiated rAAV 
infected primary cells can resulted in approximately 50% transduction in 
differentiated xenografts. Functional expression of CFTR in CF primary airways is 
evaluated by transepithelial potential difference (PD) measurements (Jiang et al., 
1998). 

In addition, a "c/s-activation" approach may be employed (Example 6) to 
utilize the concatamerization process to deliver full-length CFTR. For example, a 
synthetic minimal promoter driving full-length CFTR within one viral vector and a 
second independent vector encoding a tandem array of strong enhancer sequences 
such as RS V, CMV, and SV40 may be employed. 

A trans-splicing approach was employed to deliver the genomic epo gene to 
mice. The vectors are shown in Figure 25. The infection of mice with these vectors 
protected the mice from adenine induced anemia brought on by renal failure. 
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Example 8 

Generation of rAAV Delivery Systems for Autonomously Replicating 
Episomes Harboring EBV Replication Origins 

To prepare autonomously replicating circular episomes, e.g., to increase the 
stability of genes delivered via rAAV vectors, a rAAV vector comprising a 
replication origin of a circular episome is employed. For example, a rAAV vector 
comprising the EBV replication origin (OriP) and EBNA-1, the only viral protein 
needed to facilitate replication at this origin, is prepared (Vector 2, Figure 24). 
DNA fragments encoding EBNA-1 and OriP are excised from pREPIO (Clontech 
Inc.) and cloned into a pCisAV.RSV vector. Initially, virus is produced from 
pCisAV.EBNA-l/OriP construct and a reporter construct, e.g., AV.GFP3ori, and 
virus is used to co-infect Hela cells. Hela cells are infected with each of the vectors 
alone (Figure 24) or together (MOI = 1000 particles/cell) and serially passaged. 
Transgene expression is quantitated following each passage and Hirt DNA is 
isolated for Southern blot analysis of episomal DNA persistence. If persistence of 
GFP expression is increased in the presence of co-infection with EBNA-1 rAAV 
vectors following serial passage, there will be an increased abundance of co- 
encoding EBNA-1 /GFP rescued plasmids with increasing passage number. 

Then primary human bronchial cells are infected with each of the vectors 
alone or together at an MOI = 10,000 particles/cell and infected cells seeded into 
human bronchial xenografts. Persistence of GFP transgene expression is compared 
between infected xenografts at 1, 3, 5, and 8 weeks post-transplantation. Epithelia is 
also harvested for Hirt DNA analysis by perfusion of xenografts with SDS 
containing Hirt extraction buffer. Hirt DNA Southern blots using EBNA and GFP 
probes assess the extent of episomal persistence of monomer and concatamer 
circular genomes over time. 

An increase in the persistence of EBNA-1 hybridizing Hirt DNA in cells 
infected with the EBNA-1 vector alone, as compared to GFP-probed Hirt Southerns 
of cells infected with only the GFP encoding rAAV construct, is observed as 
monomer circularized EBNA-1 vectors are the predominant form of autonomously 
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replicating episomes. However, when cells are infected with both vectors 
simultaneously, GFP transgene expression is prolonged. Molecular analyses are 
performed to characterize the autonomously replicating circular chimeras encoding 
both GFP and EBNA-1. 

This approach may be employed for gene therapy, as circular concatamers 
with as many as five genomes, have been observed. For example, for CFTR gene 
therapy, human bronchial xenografts and the RSV-EBNA-1 rAAV vector and a 
second CFTR rAAV vector with a minimal promoter may be employed. If 
genomes are oriented in a head-to-head fashion, the RS V enhancer may increase 
transcription from the minimal CFTR promoter. 
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